{"id":18557689,"url":"https://github.com/aosingh/triplesextractor","last_synced_at":"2025-07-06T00:33:43.150Z","repository":{"id":98969797,"uuid":"65349322","full_name":"aosingh/TriplesExtractor","owner":"aosingh","description":null,"archived":false,"fork":false,"pushed_at":"2016-08-14T21:23:31.000Z","size":45,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-02-17T11:43:58.890Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/aosingh.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2016-08-10T04:00:39.000Z","updated_at":"2022-12-16T05:07:54.000Z","dependencies_parsed_at":"2023-04-16T08:46:57.565Z","dependency_job_id":null,"html_url":"https://github.com/aosingh/TriplesExtractor","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aosingh%2FTriplesExtractor","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aosingh%2FTriplesExtractor/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aosingh%2FTriplesExtractor/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aosingh%2FTriplesExtractor/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/aosingh","download_url":"https://codeload.github.com/aosingh/TriplesExtractor/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":254355410,"owners_count":22057356,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-06T21:37:44.917Z","updated_at":"2025-05-15T14:11:29.357Z","avatar_url":"https://github.com/aosingh.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# TriplesExtractor\nTriplesExtractor is a Python package, which can be used to extract `\u003cSubject, Predicate, Object\u003e` triples from a document.\n\n## Getting Started\nThese instructions will give you a copy of the project up and running on your local machine for development and testing purposes.\n\n### Clone this Repo\nClone this repository into whatever directory you'd like to work on it from:\n\n```bash\ngit clone https://github.com/aosingh/TriplesExtractor.git\n```\n\n### Install the following\n*   [Python 2.7](https://www.python.org/download/releases/2.7/)\n*   [Syntaxnet](https://github.com/tensorflow/models/tree/master/syntaxnet) \n    * python 2.7:\n    * python 3 support is not available yet\n    *   bazel:\n        *   **versions 0.2.0 - 0.2.2b, NOT 0.2.3**\n        *   follow the instructions [here](http://bazel.io/docs/install.html)\n    *   swig:\n        *   `apt-get install swig` on Ubuntu\n        *   `brew install swig` on OSX\n    *   protocol buffers, with a version supported by TensorFlow:\n    *   check your protobuf version with `pip freeze | grep protobuf`\n    *   upgrade to a supported version with `pip install -U protobuf==3.0.0b2`\n    *   asciitree, to draw parse trees on the console for the demo:\n        *   `pip install asciitree`\n    *   numpy, package for scientific computing:\n        *   `pip install numpy`\n    Once you completed the above steps, you can build and test SyntaxNet with the\n    following commands:\n    ```shell\n      git clone --recursive https://github.com/tensorflow/models.git\n      cd models/syntaxnet/tensorflow\n      ./configure\n      cd ..\n      bazel test syntaxnet/... util/utf8/...\n      # On Mac, run the following:\n      bazel test --linkopt=-headerpad_max_install_names \\\n        syntaxnet/... util/utf8/...\n    ```\n    Bazel should complete reporting all tests passed.\n\n*   [nltk](http://www.nltk.org/)  \n*   [nltk.data](http://www.nltk.org/data.html)\n*   [networkx](https://networkx.readthedocs.io/en/stable/download.html)\n\n### Change the configuration\nIn file `TriplesExtractor/configuration/properties.py`, change the following\n\n*  `SYNTAXNET_DIRECTORY` \n    * Path to your `models/syntaxnet` directory on your local system\n*  `INPUT_TEXT_DIR`\n    * Path to the location of your inputfiles\n*  `TEMP_WORKING_DIR`\n    * A temporary working location where the files will be kept while processing\n*   `OUTPUT_TRIPLES_DIR`\n    * Output directory where for each input file a corresponding output file with triples will be present after execution.\n \nBelow is an example of my `TriplesExtractor/configuration/properties.py`\n```python\n    SYNTAXNET_DIRECTORY=\"/home/abhishek/models/syntaxnet\"\n    INPUT_TEXT_DIR=\"/home/abhishek/PycharmProjects/TriplesExtractor/inputfiles\"\n    TEMP_WORKING_DIR=\"/home/abhishek/PycharmProjects/TriplesExtractor/workingdir\"\n    DONE_DIR=\"/Users/abhsingh/PycharmProjects/TripleExtractor/donedir\"\n    OUTPUT_TRIPLES_DIR=\"/home/abhishek/PycharmProjects/TriplesExtractor/output\"\n```\n\n## Development\nRun the following python class to test if everything is correctly set.\nYou can place a test input file at the `INPUT_TEXT_DIR` location.\nif the program runs successfully and exits with 0, you will find the corresponding output file at `OUTPUT_TRIPLES_DIR`\n```bash\npython TriplesExtractor/TriplesOrchestrator/TriplesOrchestrator.py\n\n``` \n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Faosingh%2Ftriplesextractor","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Faosingh%2Ftriplesextractor","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Faosingh%2Ftriplesextractor/lists"}