{"id":20763700,"url":"https://github.com/infosys/berguig","last_synced_at":"2025-07-16T14:39:06.343Z","repository":{"id":97110332,"uuid":"250236615","full_name":"Infosys/Berguig","owner":"Infosys","description":"Berguig is an application which delivers personalized news feed, tweets, youtube video links and articles published at Gartner, based on the interests specified by the user. The application gives score to these items to highlight their relevance with the help of Machine Learning models.","archived":false,"fork":false,"pushed_at":"2021-01-12T11:14:18.000Z","size":89,"stargazers_count":0,"open_issues_count":0,"forks_count":3,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-01-18T05:46:36.877Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Infosys.png","metadata":{"files":{"readme":"Readme.md","changelog":null,"contributing":null,"funding":null,"license":"License.txt","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-03-26T11:12:52.000Z","updated_at":"2021-01-12T11:14:21.000Z","dependencies_parsed_at":null,"dependency_job_id":"0a45b1a5-8b9f-41ed-9bc8-ab96b93afbf0","html_url":"https://github.com/Infosys/Berguig","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Infosys%2FBerguig","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Infosys%2FBerguig/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Infosys%2FBerguig/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Infosys%2FBerguig/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Infosys","download_url":"https://codeload.github.com/Infosys/Berguig/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243086246,"owners_count":20234004,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-17T10:45:35.966Z","updated_at":"2025-03-11T18:03:37.641Z","avatar_url":"https://github.com/Infosys.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"Berguig Application-Python\r\n==========================\r\n\r\nBerguig is a python application to deliver personalized news feed, tweets, youtube video links and gartner articles based on the company interests specified by the user. The list of companies that interest the user are stored in XML files, which are then read by the Berguig program.\r\n\r\nExample of XML file\r\n-------------------\r\n\r\npeople.xml\r\n----------\r\nContains the person's name, mail id, groups of comapnies or individual companiees of interest\r\n\r\ncomapnies.xml\r\n-------------\r\n\r\nThe information about the companies like twitter handle, youtube channel is stored in companies.xml as below\r\n\r\n\u003ccompanies_list\u003e\r\n\u003ccompany\u003e\r\n    \u003cc_id\u003ec1\u003c/c_id\u003e\r\n    \u003cc_name\u003ecompany_name\u003c/c_name\u003e\r\n    \u003cyoutube\u003echannel_name\u003c/youtube\u003e\r\n    \u003ctwitter_name\u003e@twitter_account\u003c/twitter_name\u003e\r\n    \u003chashtag\u003e#hashtag\u003c/hashtag\u003e\r\n\t\u003ckeyword\u003ekeyword1\u003c/keyword\u003e\r\n\t\u003ckeyword\u003ekeyword2\u003c/keyword\u003e\r\n\u003c/company\u003e\r\n\u003ccompany\u003e\r\n    \u003cc_id\u003ec1\u003c/c_id\u003e\r\n    \u003cc_name\u003ecompany_name\u003c/c_name\u003e\r\n    \u003cyoutube\u003echannel_name\u003c/youtube\u003e\r\n    \u003ctwitter_name\u003e@twitter_account\u003c/twitter_name\u003e\r\n    \u003chashtag\u003e#hashtag\u003c/hashtag\u003e\r\n\t\u003ckeyword\u003ekeyword1\u003c/keyword\u003e\r\n\t\u003ckeyword\u003ekeyword2\u003c/keyword\u003e\r\n\u003c/company\u003e\r\n\u003c/companies_list\u003e\r\n\r\nkeywords_list.xml\r\n-----------------\r\n\r\ncontains a list of keywords that the application will use to filter the news articles from google news API.\r\n\r\nlog.xml\r\n-----------------\r\n\r\nContains the last executed date of the program.\r\n\r\nAlgo_4\r\n------------------\r\n\r\nThe Algo_4 is used to train the machine learning model. The input for this python program is the scoring_v3.xlsx file.\r\nThe user has to score the articles generated by the application to train it to classify between relevant and non-relevant news.\r\n\r\nScoring_v3.xlsx\r\n------------------\r\n\r\nThis xlsx file is used to train the machine learning model. The user has to score the articles generated by the application and store in this xlsx in the format shown below:\r\n\r\nCategory\tCompany\tSource/User\tKeywords\tSubjects/Views\tTitle/Tweet\tDate\tLink\tUseful\tRelevance\tRelevance_fig\r\n\t\t\t\t\t\t\t\t\t\t\r\nThe possible values of useful are \r\n\r\n0,1,2 : Not Useful\r\n\r\n3,4,5 : Useful\r\n\r\nRelevance values based on useful values:\r\n\r\n0,1,2 : Relevant\r\n\r\n3,4,5 : Irrelevant\r\n\r\nRelevance_fig based in the Relevance values:\r\n\r\nRelevant : 1\r\n\r\nIrrelevant : 0\r\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Finfosys%2Fberguig","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Finfosys%2Fberguig","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Finfosys%2Fberguig/lists"}