Wednesday, August 6, 2008

Rich UI framework

Everyone has embark into Web 2.0 era. With the aid of available framework, developer can easily enhance web application with rich user-interface. Rich UI is achieved by Ajax functionalities. ExtJs, icefaces, google GWT are the common and consist of relatively full suite of powerful component which speed up the development of web application. ExtJs make up of javascript codes which allow developer to describe the UI "swing style" in javascript. With ExtJs framework, UI and back-end are clearly separated. It means that client browser will do the render of UI and the server does the pure data processing. For that, UI can consider as a skin for the web application and can be modified without changes in the back-end code. For team development, design and programming task can be divide and not dependent of one another.
Icefaces JSF framework allow programmer to create template in Jsf format. And after compilation, javascript and html are automatically created for the web application. It it easier for JSF developer to integrate icefaces. However the cons are control of the html Dom component is limited. Quite tedious for developer who required to customize the html. Google toolkit GWT allow developer to create UI using "swing" java codes. Upon compile, javascript and html will be generated automatically. Since the UI is described in java, java features can be utilized and more robust. But the trade off is the compilation take time. Every change in UI required a re-compile.

Thursday, June 5, 2008

Normalize Search ranking for documentum and lucene

Documentum come with its own ranking formula. Administrator has no way to fine tune the configuration to integrate with other open source search engine. Rank normalization can be achieve through user feedback. Simply user themselves vote and assert priority on the individual search engine. How to determine the voting? For every search result the user and selected, the respective search engine vote will plus one. therefore the next search query, The higher vote search engine get to display more result as compare to the remaining search engine. However there always a fixed threshold to ensure at least each search engine got a chance to display some result for example 5 articles.

Simplest approach - set a fixed number of results for each search engine. Equal priority is assign to each search engine. Total result = Set( top 5 elements from SE01 ..... SE0X )

Captiva InputAccel Server

Server is configured as a window service in MS operating system which is suitable for administrator to manage the production service. Error or reporting log can bind to NT event logs or external text file.

User management is mapped to the NT user account. Indirectly the user account for IA service is manage via the NT users manager. However the security rights and roles are configured in IA server itself.

Maintenance and house-keeping task can be automated via the timer setup. For Example, remove all non-active batches after 2 weeks.

Monday, June 2, 2008

EMC Captiva InputAccel

Trying to archive huge amount of hardcopy document?

Beside the basic scanning process to convert hardcopy to digital format to ease backup of information. InputAccel provides additional OCR, form validation, data extraction and export feature.

Validation - requires operator to validate the required fields and provide the correct inputs. Possible to integrate validation Dll file to communicate to the server, or validate via a txt file. For Example, the fields value can be auto-populated via a odbc connection and select by specifying the SQL query where clause. Operator can confirm the populated value with the image and perform a final verification. The setup is quite flexible as if the record is not found within the database, insertion will be perform else update of record.

Data extraction - Allow user to pre-defined the area which hold the required data. Ability to extract data from barcode.

Image Enhancement - Similar to photoshop tool which improve the quality of the image. IE is pretty impressive, its consist of a full suite of features to fine tune the scanned images. Tools such as remove border, skewing, noise removal. With such filters, the processed images would be in a good condition for OCR or export into pdf. Two main set of filters are categorized into black/white and color.

Export - documentum, commonly used databases, xml etc.

Sunday, June 1, 2008

Documentum Search

Documentum come with a index search component. The search engine consist of 2 parts. The Index server and indexing agent. Upon inserting of new record into documentum content server. the file would be automatic append to the indexing queue. And depend on the file size, 5-10 minutes is required to index the new file and ready to be search. The support file types can be indexed are txt, (S)pdf, doc... etc. With documentum search can include structure/unstructure data. Advance search such as phrase, partial searching can be supported.

EMC documentum

Data is not simple anymore. Data consist of text, pictures and mutlimedia files. Technically is divided into structure/unstructure data. The existing database can handle structure data well. However for unstructure data is more complex and usually bigger in file size which is not efficient to be store in traditional database. Documentum separate the storage of structure and unstructure data. unstructure Data will be stored in its original state and reside in a file system repository. As for structure data will be place into popular RDBMS. Documentum provide a intermediate interface which help to retrieve and store structure/unstructure data efficient. Therefore the complexity is reduced. Developer can use documentum API to retrieve data. The management of unstructure/structure data is handle completely by documentum itself.

Documentum describe file as an object. OOP concept. Which is like object mapped to relational database. Documentum also have their query language which is similar to sql statements.

ExtJS web uploading tool

extjs framework with ajax allows remote file management just like windows explorer. Loading progress bar and transfer rate are shown to ensure effective response to user. Extjs also provide batch uploading feature.

With the Tree Panel structure, navigation of file directory is fast and look ahead search feature can be added.

One of the advantages of Ajax is asynchronisation allows agent to perform other task while the uploading still in process. And upon successful uploaded, proper alert message will be trigger to notify agent of the upload status. Therefore the web application itself can be more efficient and allow multi-tasking.

Saturday, May 31, 2008

EXTJS

Javascript framework to build java Swing look alike web application which provide rich UI and seamless functionalities with ajax. Extjs is suitable to develop database type application and it allows rapid design of prototype to confirm the UI in the early stage of project development. Extjs has useful front-end field validation feature to check for blank value, text length, support regular expression and remote validation. All these feature can be easily enable by simple configuration. Remote validation is useful to check for duplicated information via the backend database.
Another extjs goodies is the grid panel. The grid panel does support rearranging of column, filtering tool, sorting of column and pagination. The grid panel is perfect for tabulating data for enterprise applcation. User can easily summaries the data based on their own rules. Similar to MS Excel functionality.
It is also easily for traditional website to adopt this ExtJs framework. Extjs component can be render into html div tag. Therefore existing html template can be retain and assign a delicated layer to display extjs component. Extjs component appearance depend on css. The color theme can be edited to suit the existing website UI .