https://github.com/altoxml/schema
ALTO XML schema - latest and all former versions
https://github.com/altoxml/schema
alto alto-xml alto-xml-schema ocr optical-character-recognition schema
Last synced: 3 months ago
JSON representation
ALTO XML schema - latest and all former versions
- Host: GitHub
- URL: https://github.com/altoxml/schema
- Owner: altoxml
- Created: 2013-11-18T15:40:58.000Z (over 11 years ago)
- Default Branch: master
- Last Pushed: 2024-07-16T15:11:19.000Z (11 months ago)
- Last Synced: 2024-07-31T21:54:16.623Z (10 months ago)
- Topics: alto, alto-xml, alto-xml-schema, ocr, optical-character-recognition, schema
- Homepage:
- Size: 5.85 MB
- Stars: 51
- Watchers: 19
- Forks: 4
- Open Issues: 25
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-ocr - ALTO XML Schema - XML Schema and development of the ALTO XML format (Software / OCR file formats)
README
## [ALTO XML schema](https://github.com/altoxml/schema/wiki)
This repository contains ALTO schema versions - drafts and final released ones.All open issues and discussions about changes to the ALTO standard can be found and tracked in the [issues](https://github.com/altoxml/schema/issues) repository
Latest official schema version is 4.4.
Primary source for the schema is (http://www.loc.gov/standards/alto/v4/alto-4-4.xsd)
Alternate source for the schema is (https://cdn.rawgit.com/altoxml/schema/master/v4/alto-4-4.xsd)Summary of proposed changes
* Change schema version to 4.4
* Add LANG attribute on PageType level to describe the default language used in document
* Add ROTATION attribute on PageType level to describe the default rotation used in document
* Add OTHERLANGS attribute on PageType to summarize all the languages present into a particular document
* Adapt "PointsType" documentation
* Adapt xLink attribute group documentation on "BlockType"Details about the changes of the version and further documentation can be found in the ALTO
[documentation](https://github.com/altoxml/documentation/wiki) repository.