{"id":23057253,"url":"https://github.com/applicativesystem/tair-pubmed-connect","last_synced_at":"2025-08-15T06:32:20.918Z","repository":{"id":262142685,"uuid":"845684046","full_name":"applicativesystem/tair-pubmed-connect","owner":"applicativesystem","description":"tair id to pubmed links for articls, genes, annotations, locus tag","archived":false,"fork":false,"pushed_at":"2024-08-21T19:45:00.000Z","size":3,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2024-11-10T20:18:19.229Z","etag":null,"topics":["genome-analysis","genome-annotation","literature-modelling","ncbi","ncbi-database","ncbi-sra","pubmed","pubmed-literature","tair","tair3","topic-analysis"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/applicativesystem.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-08-21T18:13:59.000Z","updated_at":"2024-09-11T14:00:53.000Z","dependencies_parsed_at":"2024-11-10T20:18:59.044Z","dependency_job_id":"ebea1d50-0a13-48c5-987b-0da5869d3d24","html_url":"https://github.com/applicativesystem/tair-pubmed-connect","commit_stats":null,"previous_names":["applicativesystem/tair-pubmed-connect"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/applicativesystem%2Ftair-pubmed-connect","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/applicativesystem%2Ftair-pubmed-connect/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/applicativesystem%2Ftair-pubmed-connect/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/applicativesystem%2Ftair-pubmed-connect/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/applicativesystem","download_url":"https://codeload.github.com/applicativesystem/tair-pubmed-connect/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":229897445,"owners_count":18141135,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["genome-analysis","genome-annotation","literature-modelling","ncbi","ncbi-database","ncbi-sra","pubmed","pubmed-literature","tair","tair3","topic-analysis"],"created_at":"2024-12-16T02:11:43.846Z","updated_at":"2024-12-16T02:11:44.260Z","avatar_url":"https://github.com/applicativesystem.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# tair-pubmed-connector\n\n- There is no function to fetch automatically the information on the reported pubmed articles links in the tair to be used for the language models, so i coded this function which will take the tair information, a gene or locus tag and will fetch the corresponding pubmed and then from the pubmed the corresponding abstracts, which can be directly used for the text summarization into langchain or the llm models.\n\n```\npython\nreadTairNCBI(\"/Users/gauravsablok/Desktop/CodeCheck/release/ATH_GO_GOSLIM.txt\", \\\n                  \"/Users/gauravsablok/Desktop/CodeCheck/release/ATH_GO_GOSLIM.out\", \\\n                          \"2200935\")\n# this is just a reference id for checking the functionality\n[('https://pubmed.ncbi.nlm.nih.gov/30356219/',\n  'Nitrogen is an essential macronutrient for plant growth and basic metabolic processes. \n  The application of nitrogen-containing fertilizer increases yield, which has been a \n  substantial factor in the green revolution1. Ecologically, however, excessive application \n  of fertilizer has disastrous effects such as eutrophication2. A better understanding \n  of how plants regulate nitrogen metabolism is critical to increase plant yield and reduce \n  fertilizer overuse. Here we present a transcriptional regulatory network and twenty-one \n  transcription factors that regulate the architecture of root and shoot systems in response \n  to changes in nitrogen availability. Genetic perturbation of a subset of these transcription \n  factors revealed coordinate transcriptional regulation of enzymes involved in nitrogen metabolism. \n  Transcriptional regulators in the network are transcriptionally modified by feedback via \n  genetic perturbation of nitrogen metabolism. The network, genes and gene-regulatory modules \n  identified here will prove critical to increasing agricultural productivity.'),\n ('https://pubmed.ncbi.nlm.nih.gov/34562334/',\n  'Unraveling gene function is pivotal to understanding the signaling cascades that control plant \n  development and stress responses. As experimental profiling is costly and labor intensive, \n  there is a clear need for high-confidence computational annotation. In contrast to detailed \n  gene-specific functional information, transcriptomics data are widely available for both model and \n  crop species. Here, we describe a novel automated function prediction method, which leverages \n  complementary information from multiple expression datasets by analyzing study-specific gene \n  co-expression networks. First, we benchmarked the prediction performance on recently characterized \n  Arabidopsis thaliana genes, and showed that our method outperforms state-of-the-art expression-based \n  approaches. Next, we predicted biological process annotations for known (n = 15 790) and unknown \n  (n = 11 865) genes in A. thaliana and validated our predictions using experimental protein-DNA and \n  protein-protein interaction data (covering \u003e220 000 interactions in total), obtaining a set of \n  high-confidence functional annotations. Our method assigned at least one validated annotation to \n  5054 (42.6%) unknown genes, and at least one novel validated function to 3408 (53.0%) genes with \n  computational annotations only. These omics-supported functional annotations shed light on a \n  variety of developmental processes and molecular responses, such as flower and root development, \n  defense responses to fungi and bacteria, and phytohormone signaling, and help fill the information \n  gap on biological process annotations in Arabidopsis. An in-depth analysis of two context-specific \n  networks, modeling seed development and response to water deprivation, shows how previously \n  uncharacterized genes function within the respective networks. Moreover, our automated function \n  prediction approach can be applied in future studies to facilitate gene discovery for crop improvement.'),\n ('https://pubmed.ncbi.nlm.nih.gov/11118137/',\n  'The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of \n  transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over \n  5% of its genome to code for more than 1500 transcription factors, about 45% of which are from \n  families specific to plants. Arabidopsis transcription factors that belong to families common \n  to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the \n  conserved DNA binding domains, many of which have been arranged in combinations specific to each \n  lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the \n  regulation of transcription.')]\n```\n\nGaurav Sablok \\\nUniversity of Potsdam \\\nPotsdam,Germany\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fapplicativesystem%2Ftair-pubmed-connect","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fapplicativesystem%2Ftair-pubmed-connect","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fapplicativesystem%2Ftair-pubmed-connect/lists"}