{"id":25167147,"url":"https://github.com/biocomputingup/strpsearch","last_synced_at":"2025-06-26T07:05:07.343Z","repository":{"id":219745265,"uuid":"713885373","full_name":"BioComputingUP/STRPsearch","owner":"BioComputingUP","description":null,"archived":false,"fork":false,"pushed_at":"2025-05-20T21:11:57.000Z","size":141383,"stargazers_count":6,"open_issues_count":2,"forks_count":0,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-05-20T22:38:59.498Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/BioComputingUP.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2023-11-03T12:52:26.000Z","updated_at":"2025-02-12T15:15:12.000Z","dependencies_parsed_at":"2025-05-20T22:52:09.431Z","dependency_job_id":null,"html_url":"https://github.com/BioComputingUP/STRPsearch","commit_stats":null,"previous_names":["biocomputingup/strpsearch"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/BioComputingUP/STRPsearch","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/BioComputingUP%2FSTRPsearch","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/BioComputingUP%2FSTRPsearch/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/BioComputingUP%2FSTRPsearch/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/BioComputingUP%2FSTRPsearch/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/BioComputingUP","download_url":"https://codeload.github.com/BioComputingUP/STRPsearch/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/BioComputingUP%2FSTRPsearch/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":262018765,"owners_count":23245621,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-02-09T06:19:43.720Z","updated_at":"2025-06-26T07:05:07.325Z","avatar_url":"https://github.com/BioComputingUP.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# STRPsearch\nSTRPsearch is a specialized tool designed for rapid and precise identification and mapping of structured tandem repeats in proteins (STRPs).\n\nIf you find STRPsearch useful for your research, please cite:\n\nMozaffari S, Arrías PN, Clementel D, Piovesan D, Ferrari C, Tosatto SCE, Monzon AM. STRPsearch: fast detection of structured tandem repeat proteins. Bioinformatics. 2024;40(12):btae690. https://doi.org/10.1093/bioinformatics/btae690.\n\n## Getting Started\n\nTo get started with the project, first, extract the contents of `data/databases.zip` by running the following command: \n```\ncd data \u0026\u0026 unzip databases.zip \u0026\u0026 cd ..\n```\nThen you can choose one of the following methods to set up the software:\n\n### Method 1: Using requirements.txt\n\n1. Install all the dependencies listed in the `requirements.txt` file:\n```\npip install -r requirements.txt\n```\nNote: Inside the requirements.txt file, you'll find a commented section that includes dependencies which cannot be installed with pip. To install these dependencies, you can use Conda by running the following commands:\n```\nconda install -c conda-forge -c bioconda foldseek\nconda install -c bioconda tmalign\n```\n2. Navigate to the main directory of the project and run the software with the following command:\n```\npython3 ./bin/strpsearch.py [OPTIONS] COMMAND [ARGS]...\n```\n\n### Method 2: Using Conda Environment\n1. Import and activate the Conda environment from the `environment.yml` file:\n```\nconda env create -f environment.yml\nconda activate strpsearch_env\n```\n2. Navigate to the main directory of the project and run the software with the following command:\n```\npython3 ./bin/strpsearch.py [OPTIONS] COMMAND [ARGS]...\n```\n\n### Method 3: Using Docker\n1. Build the Docker image using the provided `Dockerfile`:\n```\ndocker build -t strpsearch .\n```\n2. To run the container in an interactive mode, use the following command:\n```\ndocker run -it --entrypoint /bin/bash -v /mount/directory/:/app strpsearch\n```\nBe aware that `-v /mount/directory/:/app` command mounts the specified directory (`/mount/directory/`) to the working directory of the container. This ables the container to read and write files on the host machine.\n\n3. Navigate to the main directory of the project and run the software with the following command:\n```\npython3 ./bin/strpsearch.py [OPTIONS] COMMAND [ARGS]...\n```\n\n## Usage:\nThe tools has three Commands, each with its positional arguments and options. \n\nTo list the available commands run:\n\n```python3 bin/strpsearch.py --help```\n\nWhich returns the following commands:\n\n| Command | Description |\n|---------|-------------|\n| `query-file` | Query an existing PDB/CIF formatted structure file by providing the file path |\n| `download-pdb` | Download and query a structure from PDB by providing the PDB ID and the specific Chain of interest |\n| `download-model` |  Download and query an AlphaFold model by providing the UniProt ID and the AlphaFold version of interest |\n| `version` | Show the version and exit | \n\n## query-file\n\n### Arguments\n* `input_file` (TEXT):  Path to the input structure file to query (PDB/mmCIF). This argument is required. Default: None\n* `out_dir` (TEXT): Path to the output directory. This argument is required. Default: None\n\n### Options\n* `--chain` (TEXT): Specific chain to query from the structures. Default: all\n* `--temp-dir` (TEXT): Path to the temporary directory. Default: /tmp\n* `--max-eval` (FLOAT): Maximum E-value of the targets to prefilter. Default: 0.01\n* `--min-height` (FLOAT): Minimum height of TM-score signals to be processed. Default: 0.4\n* `--keep-temp / --no-keep-temp`: Whether to keep the temporary directory and files. Default: no-keep-temp\n* `--pymol-pse / --no-pymol-pse`: Whether to create and output PyMOL session files. Default: no-pymol-pse\n* `--help`: Show this message and exit\n\n## download-pdb\n\n### Arguments\n* `pdb_id` (TEXT): PDB ID of the experimental structure to download and query. This argument is required. Default: None\n* `out_dir` (TEXT): Path to the output directory. This argument is required. Default: None\n\n### Options\n* `--chain` (TEXT): Specific chain to query from the structures. Default: all\n* `--temp-dir` (TEXT): Path to the temporary directory. Default: /tmp\n* `--max-eval` (FLOAT): Maximum E-value of the targets to prefilter. Default: 0.01\n* `--min-height` (FLOAT): Minimum height of TM-score signals to be processed. Default: 0.4\n* `--keep-temp / --no-keep-temp`: Whether to keep the temporary directory and files. Default: no-keep-temp\n* `--pymol-pse / --no-pymol-pse`: Whether to create and output PyMOL session files. Default: no-pymol-pse\n* `--help`: Show this message and exit\n\n## download-model\n\n### Arguments\n* `uniprot_id` (TEXT): UniProt ID of the AlphaFold-predicted model to download and query. This argument is required. Default: None\n* `af_version` (TEXT): Version of AlphaFold to download predicted models from. This argument is required. Default: None\n* `out_dir` (TEXT): Path to the output directory. This argument is required. Default: None\n\n### Options\n* `--temp-dir` (TEXT): Path to the temporary directory. Default: /tmp\n* `--max-eval` (FLOAT): Maximum E-value of the targets to prefilter. Default: 0.01\n* `--min-height` (FLOAT): Minimum height of TM-score signals to be processed. Default: 0.4\n* `--keep-temp / --no-keep-temp`: Whether to keep the temporary directory and files. Default: no-keep-temp\n* `--pymol-pse / --no-pymol-pse`: Whether to create and output PyMOL session files. Default: no-pymol-pse\n* `--help`: Show this message and exit\n\n## Note\nTo generate and output PyMOL sessions using the --pymol-pse option, you must have PyMOL installed. You can install PyMOL using conda with one of the following commands, or compile it from source (https://www.pymol.org/):\n```\nconda install -c conda-forge -c schrodinger pymol-bundle=2.6\nconda install -c conda-forge pymol-open-source\n```\n\n## Examples\n\nIf you already have a PDB/CIF formatted structure file, and you want to query all the chains in the structure, keeping temporary directory and files:\n```\npython3 ./bin/strpsearch.py query-file /input/file /output/directory --keep-temp\n```\n\nIf you want to automatically download and query a specific experimental structure from PDB (e.g. chain B of PDB structure 1A0R), without keeping temporary directory and files:\n```\npython3 ./bin/strpsearch.py download-pdb 1a0r /output/directory --chain B\n```\n\nIf you want to automatically download and query a predicted-model from AlphaFold (e.g. UniProt ID: Q9HXJ7)\n```\npython3 ./bin/strpsearch.py download-model Q9HXJ7 /output/directory \n```\n\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbiocomputingup%2Fstrpsearch","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fbiocomputingup%2Fstrpsearch","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbiocomputingup%2Fstrpsearch/lists"}