https://github.com/fomightez/structurework

programs and scripts for molecular structure analysis
https://github.com/fomightez/structurework

Last synced: about 2 months ago
JSON representation

programs and scripts for molecular structure analysis

Host: GitHub
URL: https://github.com/fomightez/structurework
Owner: fomightez
Created: 2014-06-20T16:17:08.000Z (almost 11 years ago)
Default Branch: master
Last Pushed: 2025-03-03T02:07:33.000Z (3 months ago)
Last Synced: 2025-03-26T10:52:20.436Z (2 months ago)
Language: Python
Size: 450 KB
Stars: 10
Watchers: 1
Forks: 10
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

structurework
=============

Repository for programs and [Python scripts](https://github.com/fomightez/structurework/tree/master/python_scripts) for molecular structure analysis.

See the sub-folders in this repository for specifics. (Sorely need to improve on this.)

Related 'Binderized' Utilities
----------------------------

Collection of links to launchable Jupyter environment where various structure/function analysis tools work. Many of my recent scripts are built with use in these environments in mind:

- [bio3d-binder](https://github.com/fomightez/bio3d-binder) - launchable, working Jupyter-based environment with the Bio3D package for Macromolecular Structure Analysis running in R+Jupyter (RStudio is an option there, too) with some examples (*R-based*).

- [cl_demo-binder](https://github.com/fomightez/cl_demo-binder) - launchable, working Jupyter-based environment that has a collection of demonstrations of useful resources on command line (or useable in Jupyter sessions) for manipulating structure files.

- [PDBmodelComparator](https://github.com/fomightez/PDBmodelComparator) - launchable, working Jupyter-based environment that has a collection of demonstrations of resources generating overviews between models of a series of related macromolecular structures available in the Protein Data Bank. If you have two or more models of essentially the same structure and are wondering which one has the information you seek, this tool is for you. It allows you to update the summaries as more related structures are solved. Additionally, it makes a separate summary of which structure has the most of a each chain represented. The information incorporated can be customized with addiitonal knowledge. It allows you to quickly & programmatically update the summaries as more related structures are solved.

- [pdbsum-binder](https://github.com/fomightez/pdbsum-binder) - launchable, working Jupyter-based environment that has a collection of demonstrations of my useful resources for analyzing data from [PDBsum](http://www.ebi.ac.uk/thornton-srv/databases/cgi-bin/pdbsum/GetPage.pl?pdbcode=index.html). The [pdbsum-utilities sub-folder in this repo](https://github.com/fomightez/structurework/tree/master/pdbsum_utilities) is related as it host many of the scripts and code backing the data analyses.

- [PDBrenum](https://github.com/fomightez/PDBrenum) - So far I have made a repo [here](https://github.com/fomightez/PDBrenum) where the main software is already installed and runs the pipeline in sessions served by MyBinder.org. I placed a demonstration notebook `demo.ipynb` in there in addition to the `PDBrenum.ipynb` the author's provided. (I actually haven't discerned the purpose of that notebook yet, maybe it is clear in the article?) My demo notebook steps through using it in the sessions. Additionally, I give advice on how you can use `PDBrenum.py` to map chain IDs in PDB files to UniProt IDs [here](https://www.biostars.org/p/9540519/#9540582), relying on the SIFTS data which underlies the `PDBrenum.py` process, and have expanded that to a demostration notebook, `chainID_mapping_to_UniProt_id_demo.ipynb`.

- [pdbepisa-binder](https://github.com/fomightez/pdbepisa-binder) - Analysis of PDBePISA-related data via active Jupyter sessions provided via MyBinder.org. Adapt the demonstrations to analyze your favorite structures.

- [pymol-binder](https://github.com/fomightez/pymol-binder)

- [Python_basics_on_PDB_file](https://github.com/fomightez/Python_basics_on_PDB_file)

Related 'Docker-ized' Utilities
-------

[Dockerfiles for Structural Analysis (Structural Bioinformatics) Docker images](https://github.com/fomightez/Dockerfiles#dockerfiles-for-structural-analysis-structural-bioinformatics-docker-images).

Related non-Python tips
-----------------------

>"Learned a handy pair of grep flags - "grep -F -f". I find it useful for grabbing the intersection of two #cryoem particle star files, e.g. from masked classification of different domains. (here particle list is just the rlnImageName column of http://class1.star )" [Source](https://twitter.com/OliBClarke/status/1100400145286524928)

grep -F -f particle_list_class1 class2.star >& class12_instersect.star &

Related resources by others
---------------------------

- [rna-tools (previously rna-pdb-tools): a toolbox to analyze sequences, structures and simulations of RNA](https://github.com/mmagnus/rna-tools/blob/master/index-of-tools.md)

- [PDBrenum: A webserver and program providing PDB renumbered according to their UniProt sequences](http://dunbrack3.fccc.edu/PDBrenum/). The [associated scientific article](https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0253411) shows how to run it as a Python script in detail. The [Announcement](https://twitter.com/RolandDunbrack/status/1412675616156098562).

- [Converters provided by GEMMI tools in web browser via web assembly](https://project-gemmi.github.io/wasm/) - site can do (More about [Gemmi here](https://github.com/project-gemmi/gemmi)):
- PDB ➜ mmCIF
- mmCIF ➜ PDB
- mmCIF ➜ MTZ
- MTZ ➜ mmCIF
- 2 × data files ➜ mmCIF (deposition-ready)

- [bioptools](https://github.com/ACRMGroup/bioptools) has [pdbsplitchains](https://github.com/ACRMGroup/bioptools#pdbsplitchains) that Split a PDB file into separate files for each chain. (See [here](https://www.biostars.org/p/9513505/#9513508).) The package has other scripts with structure-related abilities.

- `exploder.py` by Tom Peacock [here](https://github.com/tp-peacock/pdbTools) looks to be meant to separate chains in a protein complex for visualization purposes, so you can have a scene where looks like complex pulled apart some like parts manuals often show assemblies in 'exploded view'.

- See more related tools [here](https://github.com/fomightez/structurework/tree/master/python_scripts#related-resources-by-others).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/fomightez/structurework

Awesome Lists containing this project

README