{"id":19623872,"url":"https://github.com/sr1jan/topwords-fbchat","last_synced_at":"2026-04-18T15:35:50.128Z","repository":{"id":95678133,"uuid":"129754655","full_name":"sr1jan/topwords-fbchat","owner":"sr1jan","description":"Visualization of top words used in a Facebook and Instagram group chats","archived":false,"fork":false,"pushed_at":"2020-08-01T23:53:15.000Z","size":1740,"stargazers_count":2,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"master","last_synced_at":"2025-06-12T21:14:15.523Z","etag":null,"topics":["d3js","data-visualization","python"],"latest_commit_sha":null,"homepage":"","language":"JavaScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/sr1jan.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2018-04-16T14:17:47.000Z","updated_at":"2021-11-01T22:04:15.000Z","dependencies_parsed_at":"2023-05-19T20:15:56.274Z","dependency_job_id":null,"html_url":"https://github.com/sr1jan/topwords-fbchat","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/sr1jan/topwords-fbchat","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/sr1jan%2Ftopwords-fbchat","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/sr1jan%2Ftopwords-fbchat/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/sr1jan%2Ftopwords-fbchat/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/sr1jan%2Ftopwords-fbchat/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/sr1jan","download_url":"https://codeload.github.com/sr1jan/topwords-fbchat/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/sr1jan%2Ftopwords-fbchat/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":31974802,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-18T00:39:45.007Z","status":"online","status_checked_at":"2026-04-18T02:00:07.018Z","response_time":103,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["d3js","data-visualization","python"],"created_at":"2024-11-11T11:35:54.501Z","updated_at":"2026-04-18T15:35:50.093Z","avatar_url":"https://github.com/sr1jan.png","language":"JavaScript","funding_links":[],"categories":[],"sub_categories":[],"readme":"# topwords-fbchat\n\n\u003cp align=\"center\"\u003e\nVisualization of top words used in Facebook and Instagram group chats\n\u003c/p\u003e\n\n\u003cp align='center'\u003e\n\u003cimg src=\"./wordsVISUAL_FB.PNG\" alt=\"Facebook group chat\" width=\"500\" height=\"400\" style=\"margin:2px \"/\u003e\n\u003cimg src=\"./wordsVISUAL_IG.PNG\" alt=\"Instagram group chat\" width=\"500\" height=\"400\" style=\"margin:2px\"/\u003e\n\u003c/p\u003e\n\n\u003e This is my first ever python project. Also my first github repository.\n\n### All you need to know about this project\n\nI am using my own Facebook and Instagram group chat data as the data set. The plan is to visualize the top words used in the chat using python and d3.js library of javascript.\n\n### Note\n\n\u003e Due to privacy issues I am not gonna provide my own data here. You can use your data to generate the same result. Remember to change the _.html_ filename in the **scrap_words.py** file. Code snippet:\n\n```python\nwith open('163.html', 'r', encoding='utf8') as file:\n  data = file.read()\n```\n\n\u003e Instead of _163.html_ you have to use your own filename\n\n### Steps\n\n1. scrap_words.py\n\n- Scraping and parsing data from a .html file using python beautifulsoup library\n- Cleaning the data to get words used in the chat\n- Storing words in a database using Sqlite3\n\n2. visualize_words.py\n\n- Retrieving data from the database\n- Calculating highest and lowest word frequency from top 100 words\n- Setting up font sizes for words according to their frequency\n- Saving words - frequency in gwords.js file.\n\nOnce you have completed the above steps you would have generated the **gword.js** file using that in the **gword.htm** you could easily visualize your data.\n\nTo better understand the visualiztion part you have to understand the workings of [d3.js library](https://github.com/d3/d3/wiki)\n\n\u003e If you have any questions or a project suggestion, you can reach me here: `srofficialsingh@gmail.com`\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsr1jan%2Ftopwords-fbchat","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsr1jan%2Ftopwords-fbchat","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsr1jan%2Ftopwords-fbchat/lists"}