Kirtas Technologies Glossary of Terms
Automatic Page Flattening
KABIS utilizes jets of air to flatten pages prior to virtually eliminate curvature in most books.
Dublin Core
The Dublin Core set of metadata elements provides a small and fundamental group of text elements through which most resources can be described and catalogued. Using only 15 base text fields, a Dublin Core metadata record can describe physical resources such as books, digital materials such as video, sound, image, or text files, and composite media like web pages. Metadata records based on Dublin Core are intended to be used for cross-domain information resource description and have become standard in the fields of library science and computer science. Implementations of Dublin Core typically make use of XML and are Resource Description Framework based.
JPG 2000
JPEG 2000 is a new image coding system that uses state-of-the-art compression techniques based on wavelet technology.
MARC
The MARC formats are standards for the representation and communication of bibliographic and related information in machine-readable form.
MetaData
Metadata describes how and when and by whom a particular set of data was collected, and how the data is formatted. Metadata is essential for understanding information stored in data warehouses and has become increasingly important in XML-based Web applications.
METS/ALTO
An XML schema published by the Library of Congress, and being used by the US National Digital Newspaper Project (NDNP), as well as many other newspaper digitization projects (as well as some collections of books, journals, and other textual resources). In addition to extracting machine-readable text from the page a process resulting in METS/ALTO also records information about individual articles within a page. This allows a user interface to be built where books and newspaper articles can be displayed on their own, as well as within the pages on which they were printed.
MODS (Metadata Object Description Schema)
Metadata Object Description Schema (MODS) is a schema for a bibliographic element set that may be used for a variety of purposes, and particularly for library applications. The standard is maintained by the Network Development and MARC Standards Office of the Library of Congress with input from users.
Page Edge Sensor Technology
Sensor detects multi-page or no-page lift conditions and automatically take programmable corrective action. The result is full book content integrity and quality.
Patented SmartCradle™ Design
The KABIS family of automatic book scanners with its self-centering SmartCradle technology provides truly automatic book digitization. Self-centering keeps the book centered throughout the page-turning process, eliminating operator intervention and decreasing post processing activity. The Kirtas SmartCradle gently cradles the book at 110 degrees, optimal for low stress digitization of even rare and fragile books.
Sakhr OCR
Converts scans of Arabic printed documents into digital text. Sakhr is rated #1 in recognizing clean copy Arabic text, with an output accuracy of 99%. Sakhr is the leading OCR provider for the Middle East, U.S., and Europe security and business needs.
Semantic Relevance
Semantic relevance of a feature for a given concept can be understood as a combination of two different components: 1) a local component which stands for the dominance of the attribute for that concept 2) a global component which represents the overall distinctiveness of the semantic feature in a targeted semantic domain. Intuitively, an attribute has high dominance for a concept when it is frequently mentioned in defining the concept; whereas an attribute has high distinctiveness when it is used in defining few concepts. Therefore, SR scores high when a semantic feature is both frequently mentioned in defining a concept, but only mentioned in defining few other concepts.
SmartCradle™ Automatic Cradle Centering
Kirtas systems includes the Kirtas SmartCradle which automatically keeps the book centered during the digitization process accounting for the varying book thickness, as the pages move from one side to the other during scanning, drastically reducing post processing.
Soundex
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.[1] The algorithm mainly encodes consonants; a vowel will not be encoded unless it is the first letter. Soundex is the most widely known of all phonetic algorithms, as it is a standard feature of MS SQL and Oracle, and is often used (incorrectly) as a synonym for "phonetic algorithm". Improvements to Soundex are the basis for many modern phonetic algorithms.
SureTurn™ Robotic Page-turning Arm
SureTurn robotic arm with its vacuum head will gently lift and turn the page. Within the vacuum head is our Page Edge Sensor technology to detect multi-page or no-page lift conditions and automatically take programmable corrective action. The result is full book content integrity and quality.
UNIMARC
UNIMARC designates (tags, indicators and sub field codes) to be assigned to bibliographic records in machine-readable form and to specify the logical and physical format of the records. It covers monographs, serials, cartographic materials, music, sound recordings, graphics, projected and video materials, rare books and electronic resources.
Z39.50
Z39.50 is a client–server protocol for searching and retrieving information from remote computer databases. It is covered by ANSI/NISO standard Z39.50, and ISO standard 23950. The standard's maintenance agency is the Library of Congress.
Z39.50 is widely used in library environments and is often incorporated into integrated library systems and personal bibliographic reference software. Interlibrary catalogue searches for interlibrary loan are often implemented with Z39.50 queries.
Work on the Z39.50 protocol began in the 1970s, and led to successive versions in 1988, 1992, 1995 and 2003. The Common Query Language is based on Z39.50 semantics.
|