{"id":22895768,"url":"https://github.com/pseudogene/ora","last_synced_at":"2025-08-12T02:32:29.855Z","repository":{"id":79848638,"uuid":"61105729","full_name":"pseudogene/ora","owner":"pseudogene","description":"Olfactory Receptor family Assigner (bioperl module)","archived":false,"fork":false,"pushed_at":"2016-06-16T08:58:18.000Z","size":17526,"stargazers_count":4,"open_issues_count":2,"forks_count":3,"subscribers_count":3,"default_branch":"master","last_synced_at":"2023-10-20T22:40:37.652Z","etag":null,"topics":["bats","bioinformatics","bioperl","olfactor","ora","receptor","scientific"],"latest_commit_sha":null,"homepage":null,"language":"Perl","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/pseudogene.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGES.md","contributing":null,"funding":null,"license":"LICENSE.md","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2016-06-14T08:30:33.000Z","updated_at":"2023-10-20T22:40:37.786Z","dependencies_parsed_at":null,"dependency_job_id":"87225e26-be6b-4164-a8b5-ff6ea41c0261","html_url":"https://github.com/pseudogene/ora","commit_stats":null,"previous_names":[],"tags_count":1,"template":null,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pseudogene%2Fora","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pseudogene%2Fora/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pseudogene%2Fora/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pseudogene%2Fora/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/pseudogene","download_url":"https://codeload.github.com/pseudogene/ora/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":229629256,"owners_count":18101263,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bats","bioinformatics","bioperl","olfactor","ora","receptor","scientific"],"created_at":"2024-12-13T23:31:54.256Z","updated_at":"2024-12-13T23:31:54.768Z","avatar_url":"https://github.com/pseudogene.png","language":"Perl","funding_links":[],"categories":[],"sub_categories":[],"readme":"[![Build Status](https://travis-ci.org/pseudogene/ora.svg?branch=master)](https://travis-ci.org/pseudogene/ora)\n\n# Bio::ORA\n\nOlfactory Receptor family Assigner (ORA) [bioperl module].\n\n## Description\nBio::ORA is a featherweight object for identifying mammalian olfactory receptor genes. The sequences should not be longer than 40kb. The returned array include location, sequence and statistic for the putative olfactory receptor gene. Fully functional with DNA and EST sequence, no intron supported.\n\n## Associated publication\n\n\u003e**A cluster of olfactory receptor genes linked to frugivory in bats**\n\u003eHayden S, Bekaert M, Goodbla A, Murphy WJ, Dávalos LM, Teeling EC.\n\u003e_Mol Biol Evol_. 2014 Apr;31(4):917-27.\n\n[![DOI](https://img.shields.io/badge/DOI-10.1093%2Fmolbev%2Fmsu043-blue.svg)](http://dx.doi.org/10.1093/molbev/msu043)\n\n## How to use this repository?\n\nThis repository host both the scripts and tools developed by this study. Feel free to adapt the scripts and tools, but remember to cite their authors!\n\nTo look at our scripts and raw results, **browse** through this repository. If you want to reproduce our results you will need to **clone** this repository, build the docker, and the run all the scripts. If you want to use our data for our own research, **fork** this repository and **cite** the authors.\n\n\n## Requiements\nTo use this module you may need:\n\n *  [Bioperl](http://bioperl.org/) modules,\n *  [HMMER v3+](http://hmmer.org/) distribution and\n *  [FASTA 36+](https://github.com/wrpearson/fasta36) distribution.\n\n\n## Installation\n\nYou can install the Bio::ORA module directly via [CPAN](http://search.cpan.org/~ceratites/ora/) or via [GitHub](https://github.com/pseudogene/ora):\n\n#### CPAN (easiest way)\n\n```\nperl -MCPAN -e 'install Bio::ORA'\n```\n\n\n#### GitHub (most recent version)\n\n```\ngit clone https://github.com/pseudogene/ora.git\ncd ora\nperl Makefile.pl\nmake\nmake test\nsudo make install\n```\n\n## Usage\n\n```\n..:: Olfactory Receptor Assigner (ORA) ::..\n\nUsage: or.pl [-options] --sequence=\u003csequence fasta file\u003e\n\nOptions\n -a\n    Force the use of alternative start codons, according to the current genetic code.\n    Otherwise, ATG is the only initiation codon allow.\n --expect\n    Set the E-value threshold. This setting specifies the statistical significance\n    threshold for reporting matches against database sequences. [Default 1e-10].\n --format\n    Available output format [Default fasta]:\n      fasta (FASTA format)\n      csv (Comma-separated values)\n      genbank (GenBank format)\n      tsv (Direct output for R)\n      tbl (GenBank TBL format)\n -c\n    When using a large number of contigs (e.g. newly sequenced genome), proceed to an\n    initial FASTA search to identify the contigs where to run the actual ORA search.\n --filter\n    Show ONLY the selected family number.\n --sub\n    Extract the sequences of the Fasta hits.\n --name\n    Overwrite the sequence name by the provided one. Otherwise the program will use the\n    sequence name from as input.\n --table\n    Force a genetic code to be used for the translation of the query. [Default 1]\n --size\n    Filter fragments over the specified size as functional.\n -d\n    Print all sequence details.\n\nAdvance options\n --resume\n    Resume the search from given sequence name.\n --hmmfile\n    Provide alternative HMM profiles.\n    [Default /root/or.hmm]\n --fastafile\n    Provide alternative reference OR sequences (fasta format).\n    [Default /root/or.fasta]\n -v\n    Print more possibly useful stuff, such as the individual scores for each sequence.\n```\n\n\n## Synopsis\n\nTake a sequence object from, say, an inputstream, and find an Olfactory Receptor gene. HMM profiles are used in order to identify location, frame and orientation of such gene.\n\nCreating the ORA object, _e.g._:\n\n```\nmy $inputstream = Bio::SeqIO-\u003enew( -file =\u003e 'seqfile', -format =\u003e 'fasta' );\nmy $seqobj = $inputstream-\u003enext_seq();\nmy $ORA_obj = Bio::ORA-\u003enew( $seqobj );\n```\n\nObtain an array holding the start point, the stop point, the DNA sequence and amino-acid sequence, _e.g._:\n\n```\nmy $array_ref = $ORA_obj-\u003e{'_result'} if ( $ORA_obj-\u003efind() );\n```\n\nDisplay result in genbank format, _e.g._:\n\n```\n$ORA_obj-\u003eshow( 'genbank' );\n```\n\n### Driver script\n\n```\n#!/usr/bin/perl\nuse strict;\nuse warnings;\nuse Bio::Seq;\nuse Bio::ORA;\n\nmy $inseq = Bio::SeqIO-\u003enew( '-file' =\u003e q{\u003c} . $ARGV[0], -format =\u003e 'fasta' );\nwhile (my $seq = $inseq-\u003enext_seq) {\n  my $ORA_obj = Bio::ORA-\u003enew( $seq );\n  if ( $ORA_obj-\u003efind() ) {\n    $ORA_obj-\u003eshow( 'genbank' );\n  } else {\n    print \"  no hit!\\n\";\n  }\n}\n```\n\n### Local adaptations\nThis module uses three softwares. If HMMER or FASTA are updated make sure that HMMER's hmmscan and FASTA's tfastx36 and fastx36 still exists under same name. You change the call my editing the \"Default softwares\" section of `or.pm`.\n\n```\n# Default softwares\nmy $hmmscan = 'hmmscan';\nmy $tfastx = 'tfastx36';\nmy $fastx = 'fastx36';\n```\n\nSimilarly, updates of HMMER may require to update the HMM indexes. Run `hmmpress`:\n\n```\nhmmpress -f /usr/local/bin/or.hmm\n```\n\n## Issues\n\nIf you have any problems with or questions about the scripts, please contact us through a [GitHub issue](https://github.com/pseudogene/ora/issues).\nAny issue related to the scientific results themselves must be done directly with the authors.\n\n\n## Contributing\n\nYou are invited to contribute new features, fixes, or updates, large or small; we are always thrilled to receive pull requests, and do our best to process them as fast as we can.\n\n\n## License and distribution\n\nThis code is distributed under the GNU [GPLv3 license](http://www.gnu.org/licenses/gpl-3.0.html). The documentation, raw data and work are licensed under a [Creative Commons Attribution-ShareAlike 4.0 International License](http://creativecommons.org/licenses/by-sa/4.0/).​\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpseudogene%2Fora","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fpseudogene%2Fora","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpseudogene%2Fora/lists"}