An open API service indexing awesome lists of open source software.

https://github.com/altoxml/schema

ALTO XML schema - latest and all former versions
https://github.com/altoxml/schema

alto alto-xml alto-xml-schema ocr optical-character-recognition schema

Last synced: 3 months ago
JSON representation

ALTO XML schema - latest and all former versions

Awesome Lists containing this project

README

        

## [ALTO XML schema](https://github.com/altoxml/schema/wiki)
This repository contains ALTO schema versions - drafts and final released ones.

All open issues and discussions about changes to the ALTO standard can be found and tracked in the [issues](https://github.com/altoxml/schema/issues) repository

Latest official schema version is 4.4.

Primary source for the schema is (http://www.loc.gov/standards/alto/v4/alto-4-4.xsd)

Alternate source for the schema is (https://cdn.rawgit.com/altoxml/schema/master/v4/alto-4-4.xsd)

Summary of proposed changes

* Change schema version to 4.4
* Add LANG attribute on PageType level to describe the default language used in document
* Add ROTATION attribute on PageType level to describe the default rotation used in document
* Add OTHERLANGS attribute on PageType to summarize all the languages present into a particular document
* Adapt "PointsType" documentation
* Adapt xLink attribute group documentation on "BlockType"

Details about the changes of the version and further documentation can be found in the ALTO
[documentation](https://github.com/altoxml/documentation/wiki) repository.