Access Innovations Announces New Series of Enhancements for Data Harmony Suite For MarkLogic Server Users

Albuquerque, NM, April 22, 2010 —Access Innovations, a leader in semantic enrichment and taxonomy development, today announced a new series of enhancements to their Data Harmony suite of content enrichment and thesaurus management tools specifically targeted to publishers and enterprises using MarkLogic Server. By creating and integrating subject metadata based on a taxonomy or controlled vocabulary, the Data Harmony tools add value to content in a number of ways:

The Data Harmony suite of thesaurus management and content management tools is the result of 31 years of experience continually working to develop ways to make the various parts of that process more efficient and more accurate. A major breakthrough has been the development of M.A.I.™ – the Machine Aided Indexer, which makes it possible for human categorizers (“indexers”) to increase their efficiency and consistency while adding superior descriptive data, or by automating the indexing process entirely. Customers have experienced up to a seven-fold increase in productivity using M.A.I. while measurably improving consistency and coverage of individual records.

Access Innovations extends the capabilities of MarkLogic Server and powers enhanced user experiences through these Data Harmony products:

Data Harmony Metadata Extractor tags the metadata elements within documents, adds subject terms, and creates well formed XML from PDFs, MS Office files, and other formats. By creating XML from raw source data, and making it ready for load into the MarkLogic XML Repository, Access Innovations assists publishers to get the full benefit of their investment in MarkLogic Server.

Data Harmony Search Harmony is a user interface/presentation layer which rides on top of MarkLogic Query (as well as SQL, Lucene and others.) Among the features this offers:

Data Harmony Inline Tagging finds and tags thesaurus terms and concepts (identified by rules stored within the thesaurus rule base) within the full text of the article XML or in the PDFs. This inline tagging may be viewed in XML, converted to HTML style sheets for bolding or mouse-overs, or used in displayed search results, enabling users to pinpoint the exact point in the article where the searched-for concept is mentioned. A statistical summary, showing a list of terms and the number of times each term is mentioned in the article, is also generated.