{"id":23787797,"url":"https://github.com/schultzm/cris","last_synced_at":"2025-08-10T16:07:11.189Z","repository":{"id":201757092,"uuid":"71742911","full_name":"schultzm/CRIS","owner":"schultzm","description":"Find CRISPR binding sites","archived":false,"fork":false,"pushed_at":"2016-10-28T12:08:13.000Z","size":5406,"stargazers_count":0,"open_issues_count":10,"forks_count":3,"subscribers_count":0,"default_branch":"master","last_synced_at":"2025-07-23T13:50:17.712Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/schultzm.png","metadata":{"files":{"readme":"ReadMe.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2016-10-24T01:59:08.000Z","updated_at":"2016-10-24T02:02:02.000Z","dependencies_parsed_at":"2024-03-04T12:52:04.811Z","dependency_job_id":null,"html_url":"https://github.com/schultzm/CRIS","commit_stats":null,"previous_names":["schultzm/cris"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/schultzm/CRIS","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/schultzm%2FCRIS","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/schultzm%2FCRIS/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/schultzm%2FCRIS/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/schultzm%2FCRIS/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/schultzm","download_url":"https://codeload.github.com/schultzm/CRIS/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/schultzm%2FCRIS/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":269748045,"owners_count":24469105,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-08-10T02:00:08.965Z","response_time":71,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-01-01T15:17:43.729Z","updated_at":"2025-08-10T16:07:11.162Z","avatar_url":"https://github.com/schultzm.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"![alt text][logo]\n\n[logo]: http://i1.wp.com/www.artofthecell.com/wp-content/uploads/2014/06/Art-of-the-Cell-CRISPR-Cas9-in-Complex-with-Guide-RNA-and-target-DNA.jpg \"CRISPR/Cas9\"\n\n\n# Find CRISPR/Cas9 binding sites in a genbank file. \nTo get help:\n```\npython CRIS.py -h\nusage: CRIS.py [-h] -s SEQ_INFILE [-l LENGTH_TARGET_SEQ]\n               [-t THREE_PRIME_CLAMP] [-p PAM_SEQ] [-q FEATURE_QUALIFIER] [-v]\n               [-n | -o]\n\nFind CRISPR/Cas9 target sites in a genbank file. Accepts files with multiple\ngenbank records per file. Author: dr.mark.schultz@gmail.com. Acknowledgements:\nTorsten Seemann, Ian Monk, Timothy P. Stinear.\n\noptional arguments:\n  -h, --help            show this help message and exit\n  -s SEQ_INFILE, --seq_infile SEQ_INFILE\n                        DNA seqs (contigs) to scan (Genbank format)\n  -l LENGTH_TARGET_SEQ, --length_target_seq LENGTH_TARGET_SEQ\n                        Set total length of target sequence NOT including the\n                        PAM sequence. Default=20.\n  -t THREE_PRIME_CLAMP, --three_prime_clamp THREE_PRIME_CLAMP\n                        At the 3' end, how long do you want the clamp sequence\n                        to be? Default=12.\n  -p PAM_SEQ, --PAM_seq PAM_SEQ\n                        Protospacer Adjacent Motif (PAM). Depends on Cas9\n                        species. Default='NGG'.\n  -q FEATURE_QUALIFIER, --feature_qualifier FEATURE_QUALIFIER\n                        Genbank feature qualifier in which to find target\n                        sites. Could be 'gene', 'CDS', 'mRNA' etc. Case-\n                        sensitive, exact spelling required. Default='gene'.\n  -v, --verbose         Verbose on. Default=False\n  -n, --no_overwrite    Do not overwrite output file if it exists.\n  -o, --overwrite       Overwrite output file if it exists, otherwise write\n                        new. Default is to overwrite.\n```\n\nExample usage:\n```\npython CRIS.py -s test_multigbk.gbk\n```\n\n##Explanation of CRIS.py\nReads in multi-record genbank file. \u003cbr\u003e\u003cbr\u003e\nUser sets up the PAM sequence, target length and 3'-clamp length and/or accepts the defaults. \u003cbr\u003e\u003cbr\u003e\nFor each record, searches through the features of type requested on command line (e.g., gene, CDS or mRNA) and finds all full length CRISPR/Cas9 target sequences matching the RegEx.\u003cbr\u003e\u003cbr\u003e\nWithin the feature, CRIS.py assesses for each of the full length CRISPR/Cas9 sequences whether the sequence at the 3'-clamp (specified on command line) is unique throughout the genome.\u003cbr\u003e\u003cbr\u003e\nAfter finding unique hits, CRIS.py assesses whether each of the full length CRISPR/Cas9 target sequences overlaps with other features of the requested type.  If a sequence is unique AND does not overlap other features, it is stored in the candidate list.  Within the candidate list, the GC content of each CRISPR/Cas9 target sequences is calculated.  Sequences with a GC content equal to the maximum GC content in the candidate set are retained.  If only one hit, this is retained as the 'best' match.  If more than one is retained, the CRISPR/Cas9 target sequence closest to the 5' end of the gene is selected as the 'best' match.  The best match is reported.  A summary is printed now and at the end of the run.  In verbose mode, lots of statements are printed as the run progresses.  \u003cbr\u003e\u003cbr\u003e\nThe output of the run is a copy of the input genbank file with all the best hits marked up in the file.  This annotated genbank can be viewed in Artemis etc.  \n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fschultzm%2Fcris","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fschultzm%2Fcris","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fschultzm%2Fcris/lists"}