INFO Features




 Overview

  • Extends the INFO Productivity Language to include free text Information Retrieval.
  • Allows the creation of databases and complete applications which require a mix of structured and free text data items.
  • Gives users the ability to enter, update, interrogate and report on both types of data with the same command language.
  • Preserves the tabular view of Relational databases while allowing any fields to contain unlimited amounts of formated text.
  • This introduction is designed to provide a description of the additional facilities which INFO-Text provides to the INFO user and should be read as a sequel to the introduction to INFO.
  • More detailed information can be obtained from the INFO Reference Manual which describes the formats, functions and options available to each of INFO-Text's commands.




 Background

The 1970s saw the development of two fundamentally different approaches to data management which can be described as structured and free text. The structured methods are powerful for dealing with highly complex relationships between data elements and are appropriate for use in bill of material applications and many others. Free Text systems, on the other hand, were designed to tackle legal and, later on, library automation tasks where the data has a less rigid structure. The free text systems provide access to the stored information through an inverted list of pointers to all significant 'words' in the text body.

Like the more structured DBMS products, free text systems support query languages for data access but the differences are tremendous. While DBMS queries are particularly strong on value searches (i.e. FIND UNPAID INVOICES OVER 1,000 AND INVOICE-DATE GREATER THAN 1/1/96), free text query languages can efficiently support word, phrase, synonym and proximity searches on even large amounts of more 'natural language' text data (i.e. FIND MICROPROCESSOR WITHIN THE SAME PARAGRAPH AS 'AERONAUTICAL NAVIGATION').

Unfortunately, many important applications require the strengths of both DBMS and free text systems to be successful and it has thus been a dream of those who have used both to see them integrated into a single environment.

INFO-Text brings the two together and enables applications to store, retrieve, update and report on any mix of structured and free text data with the same command language.



 INFO-Text's Extensions to INFO

INFO-Text offers all of the facilities of INFO, a well established productivity language for relational data management, personal computing and Rapid Application Development. In addition, INFO-Text delivers new power in the following features:

Existing File Compatability

INFO-Text extends the INFO tradition of working with existing files to include any text in ASCII form created by external editors or word processors. This feature makes possible a new range of important applications such as correspondence tracking systems, contracts administration and the linking of application source code with specifications and documentation.

Concordance

INFO-Text provides a full range of user-controlled concordance options which enable INFO's query commands (RESELECT and ASELECT) to also locate occurrences of words, parts of words, phrases and synonyms. Proximity searches with Boolean conditions and wild cards are easily performed.

Dictionary

INFO-Text maintains a complete dictionary of concorded words for each database which allows control of stopwords and is used to test and document search strategies.

Full Screen Editor

INFO-Text includes a VDU independent, full screen editor which is automatically invoked whenever text items are addressed with the ADD, UPDATE or INPUT FORM commands.

Mixed Retrieval

INFO-Text's retrieval commands (LIST, DISPLAY and PRINT) have been extended to support any mix of fixed width and free text data with new controls for text column widths, tab positions and line or page spacing. As with INFO, data can be retrieved from a selected and up to 9 RELATEd files at any time.

Relational Model

The data model preserves the tabular user view and flexibility of Relational data management while allowing any attributes or columns to contain free text.

The Data Model

INFO-Text databases consist of any number of flat tables or files which can be dynamically joined through common data items with the RELATE command.

Free text data is held separately and accessed through pointers which are either generated by INFO-Text itself or are specified by the user. It is this second option which enables INFO-Text to access text data which has been created externally. As the individual records need to contain only the pointer information to the variable length text they are physically stored in a fixed width format which has been defined by the user.