{"id":17278253,"url":"https://github.com/saketkc/moca","last_synced_at":"2025-04-12T20:14:49.477Z","repository":{"id":57442566,"uuid":"50889586","full_name":"saketkc/moca","owner":"saketkc","description":":m: Tool for motif conservation analysis","archived":false,"fork":false,"pushed_at":"2022-06-23T19:58:45.000Z","size":33041,"stargazers_count":9,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-04-12T20:14:10.547Z","etag":null,"topics":["bioinformatics","chip-seq","motif-discovery","transcription-factor-binding","transcription-factors"],"latest_commit_sha":null,"homepage":"http://saket-choudhary.me/moca","language":"PostScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"isc","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/saketkc.png","metadata":{"files":{"readme":"README.rst","changelog":"HISTORY.rst","contributing":"CONTRIBUTING.rst","funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2016-02-02T02:57:58.000Z","updated_at":"2022-06-23T19:58:49.000Z","dependencies_parsed_at":"2022-09-26T17:21:11.611Z","dependency_job_id":null,"html_url":"https://github.com/saketkc/moca","commit_stats":null,"previous_names":[],"tags_count":8,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/saketkc%2Fmoca","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/saketkc%2Fmoca/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/saketkc%2Fmoca/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/saketkc%2Fmoca/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/saketkc","download_url":"https://codeload.github.com/saketkc/moca/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248625517,"owners_count":21135514,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bioinformatics","chip-seq","motif-discovery","transcription-factor-binding","transcription-factors"],"created_at":"2024-10-15T09:11:11.518Z","updated_at":"2025-04-12T20:14:49.423Z","avatar_url":"https://github.com/saketkc.png","language":"PostScript","funding_links":[],"categories":[],"sub_categories":[],"readme":"==========================================\nMoCA: Tool for MOtif Conservation Analysis\n==========================================\n\n.. image:: https://img.shields.io/pypi/v/moca.svg\n        :target: https://pypi.python.org/pypi/moca/\n\n.. image:: https://img.shields.io/travis/saketkc/moca.svg\n        :target: https://travis-ci.org/saketkc/moca\n        \n.. image:: https://img.shields.io/badge/install%20with-bioconda-brightgreen.svg\n        :target: https://bioconda.github.io/recipes/moca/README.html\n\n.. image:: https://coveralls.io/repos/github/saketkc/moca/badge.svg?branch=master\n        :target: https://coveralls.io/github/saketkc/moca?branch=master\n        \n.. image:: https://zenodo.org/badge/50889586.svg\n        :target: https://zenodo.org/badge/latestdoi/50889586\n   \nLICENSE\n-------\nISC\n\n\n\nInstallation\n------------\n\n\nRequirements\n~~~~~~~~~~~~\n\n* pybedtools\n* biopython\n* pandas\n* scipy\n* statsmodels\n* pybigwig\n* seaborn\n* MEME==4.10.2\n\nNOTE: MoCA also relies on `fasta-shuffle-letters` that was introduced in MEME `4.11.0`\nhence if you are using `4.10.2` make sure the `fasta-shuffle-letters` is the updated one.\n\nFor a sample script see `travis/install_meme.sh`\n\nUsing Conda\n~~~~~~~~~~~\n``moca`` is most compatible with the `conda`_ environment.\n\n::\n\n    $ conda config --add channels bioconda\n    $ conda install moca\n\n\nUsing pip\n~~~~~~~~~\n\n::\n\n   $ pip install moca\n\n\nFor development\n~~~~~~~~~~~~~~~\n\n::\n\n    $ git clone https://github.com:saketkc/moca.git\n    $ cd moca\n    $ conda env create -f environment.yml python=2.7\n    $ source activate mocadev\n    $ python setup.py install\n\n\n\nWorkflow\n--------\n\nMoCA makes use of PhyloP/PhastCons/GERP scores to assess the quality of a\nmotif, the hypothesis being a 'true motif' would evolve slower as compared\nto its surrounding(flanking sequences).\n\n.. image:: https://raw.githubusercontent.com/saketkc/moca_web/master/docs/abstract/workflow.png\n\n\nUsage\n-----\n\n::\n\n    $ moca\n    Usage: moca [OPTIONS] COMMAND [ARGS]...\n\n      moca: Motif Conservation Analysis\n\n    Options:\n      --version  Show the version and exit.\n      --help     Show this message and exit.\n\n    Commands:\n      find_motifs  Run meme to locate motifs and create...\n      plot         Create stacked conservation plots\n\n\n\nMotif analysis using MEME\n~~~~~~~~~~~~~~~~~~~~~~~~~\n\nMoCA can perform motif analysis for you given a bedfile containing\nChIP-Seq peaks.\n\nGenome builds and MEME binary locations are specified through a configuraton file.\nA sample configuration file is available: `tests/data/application.cfg` and should be\nself-explanatory.\n\nmoca find_motifs\n~~~~~~~~~~~~~~~~\n\n\n::\n\n    $ moca find_motifs -h\n    Usage: moca find_motifs [OPTIONS]\n\n      Run meme to locate motifs and create conservation stacked plots\n\n    Options:\n      -i, --bedfile TEXT            Bed file input  [required]\n      -o, --oc TEXT                 Output Directory  [required]\n      -c, --configuration TEXT      Configuration file  [required]\n      --slop-length INTEGER         Flanking sequence length  [required]\n      --flank-motif INTEGER         Length of sequence flanking motif  [required]\n      --n-motif INTEGER             Number of motifs\n      -t, --cores INTEGER           Number of parallel MEME jobs  [required]\n      -g, -gb, --genome-build TEXT  Key denoting genome build to use in\n                                    configuration file  [required]\n      --show-progress               Print progress\n      -h, --help                    Show this message and exit.\n\n\nmoca plot\n~~~~~~~~~\n\n\n::\n\n    $ moca plot -h\n    Usage: moca plot [OPTIONS]\n\n      Create stacked conservation plots\n\n    Options:\n      --meme-dir, --meme_dir TEXT     MEME output directory  [required]\n      --centrimo-dir, --centrimo_dir TEXT\n                                      Centrimo output directory  [required]\n      --fimo-dir-sample, --fimo_dir_sample TEXT\n                                      Sample fimo.txt  [required]\n      --fimo-dir-control, --fimo_dir_control TEXT\n                                      Control fimo.txt  [required]\n      --name TEXT                     Plot title\n      --flank-motif INTEGER           Length of sequence flanking motif\n                                      [required]\n      --motif INTEGER                 Motif number\n      -o, --oc TEXT                   Output Directory  [required]\n      -c, --configuration TEXT        Configuration file  [required]\n      --show-progress                 Print progress\n      -g, -gb, --genome-build TEXT    Key denoting genome build to use in\n                                      configuration file  [required]\n      -h, --help                      Show this message and exit.\n\n\nExample\n-------\n\nMost users will require using the command line version only:\n\n::\n\n    $ moca find_motifs -i encode_test_data/ENCFF002DAR.bed\\\n        -c tests/data/application.cfg -g hg19 --show-progress\n\n\n\nCreating plots if you already have run MEME and Centrimo:\n\n::\n\n    $ moca plot -c tests/data/application.cfg -g hg19\\\n        --meme-dir moca_output/meme_out\\\n        --centrimo-dir moca_output/centrimo_out\\\n        --fimo-dir-sample moca_output/meme_out/fimo_out_1\\\n        --fimo-dir-control moca_output/meme_out/fimo_random_1\\\n        --name ENCODEID\n\n\n.. image:: http://www.saket-choudhary.me/moca/_static/img/ENCFF002CEL.png\n\n\nThere is also a structured API available,\nhowever it might be missing examples and documentation at places.\n\nAPI Documentation\n-----------------\n\nhttp://saketkc.github.io/moca/\n\n\n\nTests\n-----\n``moca`` is mostly extensively tested. See `code-coverage`_. \n\nRun tests locally\n\n::\n\n    $ ./runtests.sh\n\n\nCredits\n---------\n\nThis package was created with Cookiecutter_ and the `audreyr/cookiecutter-pypackage`_ project template.\n\n.. _`MoCA0.1.0`: https://github.com/saketkc/moca_web\n.. _Cookiecutter: https://github.com/audreyr/cookiecutter\n.. _`audreyr/cookiecutter-pypackage`: https://github.com/audreyr/cookiecutter-pypackage\n.. _`conda`: http://conda.pydata.org/docs/using/using.html\n.. _`code-coverage`: https://coveralls.io/github/saketkc/moca?branch=master\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsaketkc%2Fmoca","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsaketkc%2Fmoca","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsaketkc%2Fmoca/lists"}