{"id":31126663,"url":"https://github.com/lalitha2019/text-processing-with-unix","last_synced_at":"2025-09-17T23:01:57.401Z","repository":{"id":314831153,"uuid":"1056893991","full_name":"lalitha2019/text-processing-with-unix","owner":"lalitha2019","description":"Text processing with Unix utilities","archived":false,"fork":false,"pushed_at":"2025-09-15T03:03:05.000Z","size":8,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-09-15T04:32:01.128Z","etag":null,"topics":["awk","curl","sed","text-processing","unix"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/lalitha2019.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2025-09-15T01:58:52.000Z","updated_at":"2025-09-15T03:40:11.000Z","dependencies_parsed_at":"2025-09-15T04:32:02.485Z","dependency_job_id":"f37e2918-fe18-4e34-86bf-cd2cf5e75240","html_url":"https://github.com/lalitha2019/text-processing-with-unix","commit_stats":null,"previous_names":["lalitha2019/text-processing-with-unix"],"tags_count":null,"template":false,"template_full_name":null,"purl":"pkg:github/lalitha2019/text-processing-with-unix","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lalitha2019%2Ftext-processing-with-unix","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lalitha2019%2Ftext-processing-with-unix/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lalitha2019%2Ftext-processing-with-unix/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lalitha2019%2Ftext-processing-with-unix/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/lalitha2019","download_url":"https://codeload.github.com/lalitha2019/text-processing-with-unix/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lalitha2019%2Ftext-processing-with-unix/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":275680446,"owners_count":25508570,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-09-17T02:00:09.119Z","response_time":84,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["awk","curl","sed","text-processing","unix"],"created_at":"2025-09-17T23:00:42.890Z","updated_at":"2025-09-17T23:01:57.372Z","avatar_url":"https://github.com/lalitha2019.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"There are unix commands like awk and sed that make text processing such an interesting exercise.\nIn the beginning of my career I work with unix tool lex and yacc\nIn recent years I had to use unix on and off\nRefreshing for my job search this time, I fell in love with awk and sed which I hadn't had the opportunity to use at work so far.\n\nAnother fascination for me in the pase few years has been to analyse patterns in wordle answers.\nOne such exercise using python:\nhttps://github.com/lalitha2019/wordle-analysis\n\nPutting together my 2 new found interests, I started working on a project using unix now.\nObjective:\nhttps://wordfinder.yourdictionary.com/wordle/answers/ - This website maintains an archive of wordle solutions and updates daily.\n. Get the content from there in html format and extract the solutions to a text file.\n. Process the text file and organize the data in suitable form for analysis\n. Perform Analysis\n\n\nFiles:\nunix code\n- get-wordle-answers-from-web \n\nscript used by awk command in the above unix code\n- script2.txt\n\nintermediate / output files\n- t.html - html file from the url\n- finally.html - intermediate output - the answers part of html file\n- cleanAnswers.txt - intermediate output - the answers part above, formatted into neat key, value pairs (and a few additional lines which are easy to clean up, if needed)\n  This file can be used as input for data anlalysis using python or any other preferred language / tool\n  Details in this text file can be read as data structures like arrays or tables and used for analysis and visualization\n- wordleData.txt - data from the above key, value pairs (has a few additional lines that may or may not be helpful in further processing / analysis)\n  The above file is good enough for using as input for data analysis, but this was just an extension to my exercise in text processing\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flalitha2019%2Ftext-processing-with-unix","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Flalitha2019%2Ftext-processing-with-unix","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flalitha2019%2Ftext-processing-with-unix/lists"}