Text Engine

The DAF Platform is designed to easily integrate with search engines using their APIs allowing us to deliver the latest and most advanced knowledge retrieval technology for indexing and searching a wide range of distributed information sources.

The Text Search Engine supported will process hundreads of document types stored on file servers, in GroupWare systems, relational databases, document management systems, Intranets, and the Internet. Today's technology excels in distributed client/server environments and scales to large numbers of digital assets and users.

We support users in a distributed architecture using the sophisticated search engine infrastructure to simultaneously access digital assets in multiple repositories. Depending on the search engine used, users can perform Concept, Pattern, Soundex or Boolean searches over all configured repositories.

The Text Search Engines currently supported are:

Concept Searching    
 Apache Solr    

 

The Text Search Engines have a client server and SOA compliant architectures. The architecture is made up of the following basic code components:

  • Client Handler - Handles client requests for queries (which are then passed to the search servers), for document text (which may extracted, filtered, and reformatted before being returned to the user), and document meta-data information.
  • Search Servers - These are the servers that actually perform the search. They can handle multiple simultaneous queries (multi-threaded) and search over multiple repositories of information. Search servers may be clustered to handle larger databases or larger user loads.
  • Highly scalable - DAF's Text Engine is highly scalable to meet customer requirements. As the data repositories grow so will the performance. At least one query server is required on all DAF systems to actually execute the search requested by the client. Each query server can execute queries over multiple repositories, and each server can handle multiple queries simultaneously (multi-threaded). Multiple query servers return merged results and presents them to one client. This architecture allows you flexibility in distributing tasks across multiple query servers.