{"id":22025883,"url":"https://github.com/digitallinguistics/dft","last_synced_at":"2026-01-05T06:16:32.746Z","repository":{"id":75391631,"uuid":"115771708","full_name":"digitallinguistics/DFT","owner":"digitallinguistics","description":"Discourse Functional Transcription","archived":false,"fork":false,"pushed_at":"2020-03-14T19:11:53.000Z","size":24,"stargazers_count":2,"open_issues_count":10,"forks_count":1,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-01-28T17:24:10.348Z","etag":null,"topics":["corpora","corpus","corpus-data","corpus-linguistics","data-format","digital-humanities","digital-linguistics","discourse","dlx","functionalism","language","linguistics","transcription"],"latest_commit_sha":null,"homepage":"https://github.com/digitallinguistics/DFT","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/digitallinguistics.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE.md","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2017-12-30T03:44:45.000Z","updated_at":"2023-11-05T19:19:18.000Z","dependencies_parsed_at":"2023-03-03T02:46:23.588Z","dependency_job_id":null,"html_url":"https://github.com/digitallinguistics/DFT","commit_stats":null,"previous_names":[],"tags_count":1,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/digitallinguistics%2FDFT","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/digitallinguistics%2FDFT/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/digitallinguistics%2FDFT/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/digitallinguistics%2FDFT/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/digitallinguistics","download_url":"https://codeload.github.com/digitallinguistics/DFT/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245090876,"owners_count":20559298,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["corpora","corpus","corpus-data","corpus-linguistics","data-format","digital-humanities","digital-linguistics","discourse","dlx","functionalism","language","linguistics","transcription"],"created_at":"2024-11-30T07:20:23.878Z","updated_at":"2026-01-05T06:16:32.719Z","avatar_url":"https://github.com/digitallinguistics.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# Discourse Functional Transcription (DFT)\n\n_Discourse Functional Transcription_ (DFT) is a system for transcribing natural language discourse developed by [John W. DuBois][1] (Department of Linguistics, University of California, Santa Barbara). It consists of two components:\n\n* a data format for representing transcripts in human- and computer-readable form\n\n* a set of transcription conventions for representing various aspects of speech and its context\n\nThis repository contains specifications for formatting data in DFT, and the set of transcription conventions it uses.\n\n## Versioning\n\nDFT is the successor to two earlier versions of this system—DT1 and DT2 (where DT = Discourse Transcription).\n\nThis repository contains specifications for formatting data in the DT1, DT2, and DFT systems, and the set of transcription conventions used by each. It uses a form of [semantic versioning][2] to track changes to the DFT specification, where DT1 is considered v1.0, DT2 is v2.0, and DFT is v3.0. Each new version release may be viewed on the [releases page][3].\n\nSystem | Version\n------ | -----------\nDT1    | v1.0\nDT2    | v2.0\nDFT    | v3.0+\n\nFor more information on DT1 and DT2, see the following sources:\n\n* [John W. Du Bois, Susanna Cumming, Stephan Schuetze-Coburn \u0026 Danae Paolino (eds.). 1992. _Discourse transcription_. (Santa Barbara Papers in Linguistics 4).][4]\n\n* [John W. Du Bois, Stephan Schuetze-Coburn, Susanna Cumming, \u0026 Danae Paolino. 1993. Outline of discourse transcription. In Jane A. Edwards \u0026 Martin D. Lampert (eds.), _Talking data: Transcription and coding in discourse research_, 45–89. Lawrence Erlbaum.][6]\n\n* [John W. Du Bois. 2018. _Representing Discourse_. Manuscript.][5]\n\n## Citations \u0026 Attributions\n\nThis project uses [Zenodo][10] to publish the code in this repository with a citable Digital Object Identifier (DOI). Click the DOI link below to cite this repository.\n\nTo cite the latest version of the data format specifications in this repository, you may use the following bibliographic model:\n\n\u003e John W. DuBois \u0026 Daniel W. Hieber. (2017, December 30). digitallinguistics/DFT. Zenodo. https://doi.org/10.5281/zenodo.1134007\n\nYou can also cite specific versions of the specification (if you want to refer to the DT1 format, for instance), by selecting the version on Zenodo and copying its citation:\n\n[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.1134008.svg)](https://doi.org/10.5281/zenodo.1134008)\n\nTo cite data from the Santa Barbara Corpus (SBC), [use the citation guidelines found here][9].\n\n## Issues\n\nIf you see any issues in the specifications, or have any questions, please [open an issue][7].\n\n## License\n\nPlease see the [license for this repository][8] to view the licenses for different parts of this project.\n\n[1]: http://www.linguistics.ucsb.edu/people/john-w-du-bois\n[2]: https://semver.org\n[3]: https://github.com/digitallinguistics/DFT/releases\n[4]: http://www.linguistics.ucsb.edu/research/santa-barbara-papers#Volume4\n[5]: http://www.linguistics.ucsb.edu/projects/transcription/representing\n[6]: https://www.amazon.com/Talking-Data-Transcription-Discourse-Research/dp/0805803491/ref=sr_1_1?ie=UTF8\u0026qid=1514610009\u0026sr=8-1\u0026keywords=talking+data+discourse\n[7]: https://github.com/digitallinguistics/DFT/issues\n[8]: https://github.com/digitallinguistics/DFT/blob/master/LICENSE.md\n[9]: http://www.linguistics.ucsb.edu/research/santa-barbara-corpus#Citing\n[10]: https://zenodo.org\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdigitallinguistics%2Fdft","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdigitallinguistics%2Fdft","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdigitallinguistics%2Fdft/lists"}