{"id":15059279,"url":"https://github.com/fedora-infra/datagrepper","last_synced_at":"2025-05-12T14:56:16.070Z","repository":{"id":6469777,"uuid":"7709656","full_name":"fedora-infra/datagrepper","owner":"fedora-infra","description":"HTTP API for datanommer and the fedmsg bus","archived":false,"fork":false,"pushed_at":"2025-05-12T08:40:45.000Z","size":7689,"stargazers_count":43,"open_issues_count":38,"forks_count":37,"subscribers_count":11,"default_branch":"develop","last_synced_at":"2025-05-12T09:58:45.678Z","etag":null,"topics":["data","data-analysis","data-science","fedora","fedora-project","postgres","postgresql","python"],"latest_commit_sha":null,"homepage":"https://apps.fedoraproject.org/datagrepper/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/fedora-infra.png","metadata":{"files":{"readme":"README.md","changelog":"news/708.dependency","contributing":null,"funding":null,"license":"COPYING","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2013-01-19T22:19:42.000Z","updated_at":"2025-04-14T02:52:45.000Z","dependencies_parsed_at":"2023-11-06T02:42:26.644Z","dependency_job_id":"2b698ee7-853e-4ccc-86a6-14b5dd32e6ef","html_url":"https://github.com/fedora-infra/datagrepper","commit_stats":{"total_commits":700,"total_committers":30,"mean_commits":"23.333333333333332","dds":0.7414285714285714,"last_synced_commit":"046cdc29f52d9f8e04762bcc6aaa1c54044e6de4"},"previous_names":[],"tags_count":28,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/fedora-infra%2Fdatagrepper","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/fedora-infra%2Fdatagrepper/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/fedora-infra%2Fdatagrepper/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/fedora-infra%2Fdatagrepper/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/fedora-infra","download_url":"https://codeload.github.com/fedora-infra/datagrepper/tar.gz/refs/heads/develop","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":253717586,"owners_count":21952514,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data","data-analysis","data-science","fedora","fedora-project","postgres","postgresql","python"],"created_at":"2024-09-24T22:40:29.383Z","updated_at":"2025-05-12T14:56:16.039Z","avatar_url":"https://github.com/fedora-infra.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# datagrepper\n\nDatagrepper is a web application and JSON API to retrieve historical messages sent via Fedora Messaging. [Datanommer](https://github.com/fedora-infra/datanommer/) is a seperate project and service that consumes messages from the Fedora Messaging queue and puts them in a database. These messages is what datagrepper queries. \n\nDatagrepper is curently running in production at https://apps.fedoraproject.org/datagrepper/\n\n## Development Environment\n\nVagrant allows contributors to get quickly up and running with a datagrepper development environment by automatically configuring a virtual machine. \n\nThe datagrepper Vagrant environment configures configures and enables a datanommer service and database. The datanommer instance is configured to be empty when first provisioned, but to consume messages from the stage Fedora Messaging queue.\n\n### Install vagrant\nTo get started, run the following commands to install the Vagrant and Virtualization packages needed, and start the libvirt service:\n\n    $ sudo dnf install ansible libvirt vagrant-libvirt vagrant-sshfs vagrant-hostmanager\n    $ sudo systemctl enable libvirtd\n    $ sudo systemctl start libvirtd\n\n### Checkout and Provision\nNext, check out the datagrepper code and run vagrant up:\n\n    $ git clone https://github.com/fedora-infra/datagrepper\n    $ cd datanommer\n    $ vagrant up\n\n### Interacting with your development datagrepper\nAfter successful provisioning of the Datagrepper vagrant setup, the datagrepper web application will be accessible from your host machine's web browser at\n\nhttp://datagrepper.test:5000/\n\n\n\n\n### Using the development environment\nSSH into your newly provisioned development environment:\n\n    $ vagrant ssh\n\nThe vagrant setup also defines 4 handy commands to interact with the service that runs the datagrepper flask application: \n\n    $ datagrepper-start\n    $ datagrepper-stop\n    $ datagrepper-restart\n    $ dataprepper-logs\n\nAdditionally, the following commands are also available for interacting with the datanommer service:\n\n    $ datanommer-consumer-start\n    $ datanommer-consumer-stop\n    $ datanommer-consumer-restart\n    $ datanommer-consumer-logs\n\n### Running the tests\nDatanommer is comprised of 3 seperate modules in this single repository. There is a handy script in the top directory of this repo to run the tests on all 3 modules:\n\n    $ ./runtests.sh\n\nHowever, tests can also be run on a single module by invotking tox in that modules' directory. For example:\n\n    $ cd datanommer.models/\n    $ tox\n\nNote, that the tests use virtual environments that are not created from scratch with every subsequent run of the tests. Therefore, **when changes happen to dependencies, the tests may fail to run correctly**. To recreate the virtual envrionments,  run the tests commands with the `-r` flag, for example:\n\n    $ ./runtests.sh -r\n\nor\n\n    $ cd datanommer.models/\n    $ tox -r\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ffedora-infra%2Fdatagrepper","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ffedora-infra%2Fdatagrepper","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ffedora-infra%2Fdatagrepper/lists"}