{"id":18412559,"url":"https://github.com/topagrume/nlp_project","last_synced_at":"2026-02-06T19:32:26.621Z","repository":{"id":240399079,"uuid":"790236493","full_name":"TopAgrume/NLP_Project","owner":"TopAgrume","description":"Poems classification and generation","archived":false,"fork":false,"pushed_at":"2024-08-02T07:50:27.000Z","size":3615,"stargazers_count":4,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-07-06T03:38:48.297Z","etag":null,"topics":["dataset","exploratory-analysis","nlp"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/TopAgrume.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-04-22T14:10:04.000Z","updated_at":"2025-07-03T13:48:18.000Z","dependencies_parsed_at":"2024-05-20T16:00:48.354Z","dependency_job_id":"3f724eca-e404-4d68-8255-db7e622d6046","html_url":"https://github.com/TopAgrume/NLP_Project","commit_stats":null,"previous_names":["topagrume/nlp1_project","topagrume/nlp_project"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/TopAgrume/NLP_Project","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TopAgrume%2FNLP_Project","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TopAgrume%2FNLP_Project/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TopAgrume%2FNLP_Project/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TopAgrume%2FNLP_Project/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/TopAgrume","download_url":"https://codeload.github.com/TopAgrume/NLP_Project/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TopAgrume%2FNLP_Project/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29174133,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-06T19:28:14.811Z","status":"ssl_error","status_checked_at":"2026-02-06T19:28:13.420Z","response_time":59,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["dataset","exploratory-analysis","nlp"],"created_at":"2024-11-06T03:42:38.140Z","updated_at":"2026-02-06T19:32:26.596Z","avatar_url":"https://github.com/TopAgrume.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# NLP_Project: Poem Classification and Generation\n\n## Project Overview\nThis project focuses on the classification and generation of poems, as well as web scraping to create our own dataset. The project is divided into several components, each utilizing different technologies and frameworks.\n\n## Datasets Used\n\n1. **First dataset for generation**: [Kaggle - Poetry Foundation Poems](https://www.kaggle.com/datasets/tgdivy/poetry-foundation-poems/data)\n2. **Second dataset for generation**: [Kaggle - Complete Poetryfoundationorg Dataset](https://www.kaggle.com/datasets/johnhallman/complete-poetryfoundationorg-dataset)\n3. **Kaggle dataset for generation**: [Kaggle - Poem Classification NLP](https://www.kaggle.com/datasets/ramjasmaurya/poem-classification-nlp)\n4. **Our first dataset for classification** (144 possible classes): [Kaggle - Poems Dataset NLP (topics part)](https://www.kaggle.com/datasets/michaelarman/poemsdataset?select=topics)\n5. **Creation of our own dataset for classification** (5 possible classes): [Kaggle - Poems Classification Dataset](https://www.kaggle.com/datasets/djdonpablo/poem-classification-dataset)\n6. **Poetry Foundation Terms of Service for Robots**: [Poetry Foundation Robots.txt](https://www.poetryfoundation.org/robots.txt)\n\nOur dataset was made by scraping the Poetry Foundation website for classification. It contains five different topics: nature, art \u0026 sciences, love, relationships, and religion, which are fairly well distributed.\n\n**See**: [Kaggle Dataset](https://www.kaggle.com/datasets/djdonpablo/poem-classification-dataset)\n\n## Technologies and Frameworks Used\n\n```\nsrc\n├── classification\n│   ├── FNN\n│   ├── Logistic Regression \u0026 Naive Bayes\n│   ├── RNN / LSTM\n│   ├── Transformers\n│   └── XGBoost\n└── generation\n    ├── Ngram\n    ├── Transformers\n    └── RNN\n```\n\n## Project Results\n\n![images/results.png](images/results.png)\n\n## Poem Generation Examples\n\n![images/gpt2-examples.png](images/gpt2_examples.png)\n\n## Members\n\n- angelo.eap\n- valentin.san\n- christophe.nguyen\n- alexandre.devaux-riviere\n- paul.duhot\n- mael.reynaud\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftopagrume%2Fnlp_project","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ftopagrume%2Fnlp_project","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftopagrume%2Fnlp_project/lists"}