An open API service indexing awesome lists of open source software.

https://github.com/digitallinguistics/dft

Discourse Functional Transcription
https://github.com/digitallinguistics/dft

corpora corpus corpus-data corpus-linguistics data-format digital-humanities digital-linguistics discourse dlx functionalism language linguistics transcription

Last synced: about 1 month ago
JSON representation

Discourse Functional Transcription

Awesome Lists containing this project

README

        

# Discourse Functional Transcription (DFT)

_Discourse Functional Transcription_ (DFT) is a system for transcribing natural language discourse developed by [John W. DuBois][1] (Department of Linguistics, University of California, Santa Barbara). It consists of two components:

* a data format for representing transcripts in human- and computer-readable form

* a set of transcription conventions for representing various aspects of speech and its context

This repository contains specifications for formatting data in DFT, and the set of transcription conventions it uses.

## Versioning

DFT is the successor to two earlier versions of this system—DT1 and DT2 (where DT = Discourse Transcription).

This repository contains specifications for formatting data in the DT1, DT2, and DFT systems, and the set of transcription conventions used by each. It uses a form of [semantic versioning][2] to track changes to the DFT specification, where DT1 is considered v1.0, DT2 is v2.0, and DFT is v3.0. Each new version release may be viewed on the [releases page][3].

System | Version
------ | -----------
DT1 | v1.0
DT2 | v2.0
DFT | v3.0+

For more information on DT1 and DT2, see the following sources:

* [John W. Du Bois, Susanna Cumming, Stephan Schuetze-Coburn & Danae Paolino (eds.). 1992. _Discourse transcription_. (Santa Barbara Papers in Linguistics 4).][4]

* [John W. Du Bois, Stephan Schuetze-Coburn, Susanna Cumming, & Danae Paolino. 1993. Outline of discourse transcription. In Jane A. Edwards & Martin D. Lampert (eds.), _Talking data: Transcription and coding in discourse research_, 45–89. Lawrence Erlbaum.][6]

* [John W. Du Bois. 2018. _Representing Discourse_. Manuscript.][5]

## Citations & Attributions

This project uses [Zenodo][10] to publish the code in this repository with a citable Digital Object Identifier (DOI). Click the DOI link below to cite this repository.

To cite the latest version of the data format specifications in this repository, you may use the following bibliographic model:

> John W. DuBois & Daniel W. Hieber. (2017, December 30). digitallinguistics/DFT. Zenodo. https://doi.org/10.5281/zenodo.1134007

You can also cite specific versions of the specification (if you want to refer to the DT1 format, for instance), by selecting the version on Zenodo and copying its citation:

[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.1134008.svg)](https://doi.org/10.5281/zenodo.1134008)

To cite data from the Santa Barbara Corpus (SBC), [use the citation guidelines found here][9].

## Issues

If you see any issues in the specifications, or have any questions, please [open an issue][7].

## License

Please see the [license for this repository][8] to view the licenses for different parts of this project.

[1]: http://www.linguistics.ucsb.edu/people/john-w-du-bois
[2]: https://semver.org
[3]: https://github.com/digitallinguistics/DFT/releases
[4]: http://www.linguistics.ucsb.edu/research/santa-barbara-papers#Volume4
[5]: http://www.linguistics.ucsb.edu/projects/transcription/representing
[6]: https://www.amazon.com/Talking-Data-Transcription-Discourse-Research/dp/0805803491/ref=sr_1_1?ie=UTF8&qid=1514610009&sr=8-1&keywords=talking+data+discourse
[7]: https://github.com/digitallinguistics/DFT/issues
[8]: https://github.com/digitallinguistics/DFT/blob/master/LICENSE.md
[9]: http://www.linguistics.ucsb.edu/research/santa-barbara-corpus#Citing
[10]: https://zenodo.org