NavigationUser login |
ObjectsObjects beyond textIB is unique among full-text systems in that it also provides numerous object types with their own methods of search. Native Object Types
The above types have proved to be quite flexible and to solve many IR problems. The type "hash", for example, can be used to locate known images across XML/SGML documents. Metaphone addresses, on the other hand, the typical need to handle spelling variations of names. Its based upon a variation of Lawrence Phillips’ Double Metaphone phonetic matching algorithm. It groups words not just by spellings, but also by their potential variations of pronunciation. Soundex, although, for the most part obsoleted by metaphone, still has some uses and there are cases, albeit rarer, where its use is preferred (sometimes incorrect matches can produce interesting serendipities). "External" Object typesIn addition IB provides extended distributed objects via interface glue into other systems via ODBC, CORBA or object embedding. This allows indexing content--- for example from RSS/XML--- to be stored in and searched from other systems. This is useful in many dynamic applications in commerce and trading (keeping live counts of goods on hand, selling prices, etc.).Standard interface modules are provided for ODBC and platform native Berkeley DB (resp. GDB).
AutodetectionVarious doctypes can automatically (if enabled, resp. not disabled) at index time detect a number of field datatypes:
Inline definitionObjects are typically defined, resp. managed, via configuration (.ini files) but may for a XMLish record format (GILSXML) be defined in place as Type (alternatively via configuration another word can be set). Example: <AGE Type="numerical">12</AGE> defines (or adds) the type of the field AGE as numerical. <record>
<person>
<uid Type="numerical">123457</uid>
<name>
<last Type="metaphone">Zimmermann</last>
<first>Edward</first>
</name>
<company>
<name>NONMONOTONIC Lab</name>
<address>
....
</address>
</record>In the above the XML path record/person/name/last would be defined as of type "metaphone" while record/person/company/name would be of standard string (textual) type. By Edward C. Zimmermann at 2010-05-03 08:21 index objects | add new comment
|