{"id":13416302,"url":"https://github.com/ujjwalkarn/DataSciencePython","last_synced_at":"2025-03-14T23:31:30.567Z","repository":{"id":37663853,"uuid":"43759462","full_name":"ujjwalkarn/DataSciencePython","owner":"ujjwalkarn","description":"common data analysis and machine learning tasks using python","archived":false,"fork":false,"pushed_at":"2024-04-03T10:37:58.000Z","size":83,"stargazers_count":5263,"open_issues_count":11,"forks_count":1498,"subscribers_count":348,"default_branch":"master","last_synced_at":"2024-10-29T15:34:37.693Z","etag":null,"topics":["data-science","data-scientists","python","python-tutorial"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ujjwalkarn.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2015-10-06T15:26:08.000Z","updated_at":"2024-10-28T05:46:20.000Z","dependencies_parsed_at":"2022-07-10T13:16:44.427Z","dependency_job_id":"eeb289f1-7839-48b9-afb7-83b6981b464d","html_url":"https://github.com/ujjwalkarn/DataSciencePython","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ujjwalkarn%2FDataSciencePython","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ujjwalkarn%2FDataSciencePython/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ujjwalkarn%2FDataSciencePython/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ujjwalkarn%2FDataSciencePython/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ujjwalkarn","download_url":"https://codeload.github.com/ujjwalkarn/DataSciencePython/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243663514,"owners_count":20327300,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-science","data-scientists","python","python-tutorial"],"created_at":"2024-07-30T21:00:56.693Z","updated_at":"2025-03-14T23:31:30.544Z","avatar_url":"https://github.com/ujjwalkarn.png","language":"Python","readme":"# Python Data Science Tutorials \n- This repo contains a curated list of Python tutorials for Data Science, NLP and Machine Learning.\n\n- [**Curated list of R tutorials for Data Science, NLP and Machine Learning**](https://github.com/ujjwalkarn/DataScienceR).\n\n- [Comprehensive topic-wise list of Machine Learning and Deep Learning tutorials, codes, articles and other resources](https://github.com/ujjwalkarn/Machine-Learning-Tutorials/blob/master/README.md).\n\n## The Python Language\n- [Python 3 in one picture](https://fossbytes.com/wp-content/uploads/2015/09/python-3-in-one-pic.png)\n- [**Awesome Python**](https://github.com/vinta/awesome-python)\n- [**Jargon from the functional programming world in simple terms!**](https://github.com/hemanth/functional-programming-jargon)\n- [**Dive Into Python**](http://www.diveintopython.net/index.html)\n- [Learn Python Wiki on Reddit](https://www.reddit.com/r/learnpython/wiki/index)\n- [Learn 90% of Python in 90 Minutes](https://www.slideshare.net/MattHarrison4/learn-90)\n- [Highest Voted Python Questions](http://stackoverflow.com/questions/tagged/python?sort=votes\u0026pageSize=50)\n- [Python Basic Concepts](https://github.com/gumption/Python_for_Data_Science/blob/master/3_Python_Basic_Concepts.ipynb)\n- [Quick Reference to Python](http://www.dataschool.io/python-quick-reference/)\n- [The Elements of Python Style](https://github.com/amontalenti/elements-of-python-style)\n- [**What does the yield keyword do in Python?**](http://stackoverflow.com/questions/231767/what-does-the-yield-keyword-do-in-python)\n- [Parsing values from a JSON file in Python](http://stackoverflow.com/questions/2835559/parsing-values-from-a-json-file-in-python)\n- [**Python Quora FAQs**](https://www.quora.com/topic/Python-programming-language-1)\n- [time-complexity of various operations - list/dict - in current CPython](https://wiki.python.org/moin/TimeComplexity)\n- Scripting in Python\n    - [Python Scripting Tutorial](http://www.dreamsyssoft.com/python-scripting-tutorial/intro-tutorial.php)\n    - [Scripting with Python](https://www.schrodinger.com//AcrobatFile.php?type=supportdocs\u0026type2=\u0026ident=404)\n    - [**Can I use Python as a bash replacement?**](http://stackoverflow.com/questions/209470/can-i-use-python-as-a-bash-replacement)\n\n## Useful Online Courses\n- [Learn Python (Codecademy)](https://www.codecademy.com/learn/python#)\n- [Free Interactive Course: Intro to Python for Data Science (DataCamp)](https://www.datacamp.com/courses/intro-to-python-for-data-science)\n- [Introduction to Computer Science and Programming Using Python (MIT)](https://www.edx.org/course/introduction-computer-science-mitx-6-00-1x-11)\n- [Python for Everybody](https://www.coursera.org/learn/python)\n- [Python Programming Essentials](https://www.coursera.org/learn/python-programming)\n\n## Data Science with Python\n- [**Data Science IPython Notebooks**](https://github.com/donnemartin/data-science-ipython-notebooks)\n- [Awesome Python - Data Analysis](https://github.com/vinta/awesome-python#science-and-data-analysis)\n- Statistics\n  - [Statistics and Data Science](https://github.com/svaksha/pythonidae/blob/master/Statistics.md)\n- [**An Introduction to Scientific Python (and a Bit of the Maths Behind It) – NumPy**](http://www.kdnuggets.com/2016/06/intro-scientific-python-numpy.html)\n- [Data Analysis and IPython Notebooks](https://github.com/kirang89/pycrumbs#data-analysis)\n- [Python for Data Science: Basic Concepts](https://github.com/gumption/Python_for_Data_Science/blob/master/2_Data_Science_Basic_Concepts.ipynb)\n- [Pycon India 2015 Notes](http://www.analyticsvidhya.com/blog/2015/10/notes-impressions-experience-excitement-pycon-india-2015/)\n- [**5 important Python Data Science advancements of 2015**](https://medium.com/@elgehelge/the-5-most-important-python-data-science-advancements-of-2015-a136482da89b#.sp2c1la9z)\n- [Data Exploration with Numpy cheat sheet](http://www.analyticsvidhya.com/blog/2015/07/11-steps-perform-data-analysis-pandas-python)\n- [Querying Craiglist with Python](http://chrisholdgraf.com/querying-craigslist-with-python/?imm_mid=0d8940\u0026cmp=em-data-na-na-newsltr_20150916)\n- [**An introduction to Numpy and Scipy**](http://www.engr.ucsb.edu/~shell/che210d/numpy.pdf)\n- [Create NBA Shot Charts](http://savvastjortjoglou.com/nba-shot-sharts.html)\n- [PythoR- Python meets R](http://nipunbatra.github.io/2016/01/pythor/)\n- [**How do I learn data analysis with Python?**](https://www.quora.com/How-do-I-learn-data-analysis-with-Python?redirected_qid=2464720)\n- [What are some interesting things to do with Python?](https://www.quora.com/Python-programming-language-What-are-some-interesting-things-to-do-with-Python?redirected_qid=2324227)\n- [**Which is better for data analysis: R or Python?**](https://www.quora.com/Which-is-better-for-data-analysis-R-or-Python)\n- [**Web scraping in Python**](https://github.com/ujjwalkarn/Web-Scraping)\n- [The Guide to Learning Python for Data Science](http://www.datasciencecentral.com/profiles/blogs/the-guide-to-learning-python-for-data-science-2)\n- [Python For Data Science - A Cheat Sheet For Beginners](https://www.datacamp.com/community/tutorials/python-data-science-cheat-sheet-basics)\n- [Top voted Python data science questions](http://datascience.stackexchange.com/questions/tagged/python)\n- [Awesome Python - Data Visualization](https://github.com/vinta/awesome-python#data-visualization)\n- [Awesome Python - Map Reduce](https://github.com/vinta/awesome-python#mapreduce)\n\n## Pandas Library in Python\n- [Intro to pandas data structures](http://www.gregreda.com/2013/10/26/intro-to-pandas-data-structures/)\n- [Useful Pandas Cheatsheet](https://github.com/pandas-dev/pandas/blob/master/doc/cheatsheet/Pandas_Cheat_Sheet.pdf)\n- [An Introduction to Scientific Python – Pandas](http://www.datadependence.com/2016/05/scientific-python-pandas/)\n- [10 minutes to Pandas](http://pandas.pydata.org/pandas-docs/stable/10min.html)\n- [Useful Pandas Snippets](http://www.swegler.com/becky/blog/2014/08/06/useful-pandas-snippets/)\n- [Timeseries analysis using Pandas](http://nbviewer.jupyter.org/github/twiecki/financial-analysis-python-tutorial/blob/master/1.%20Pandas%20Basics.ipynb)\n- [Pandas Exercises - Practice your Pandas skills](https://github.com/guipsamora/pandas_exercises)\n- [Grouping in Pandas](http://blog.yhat.com/posts/grouping-pandas.html)\n- [**“Large data” work flows using pandas**](http://stackoverflow.com/questions/14262433/large-data-work-flows-using-pandas)\n- [Easier data analysis with pandas (video series)](http://www.dataschool.io/easier-data-analysis-with-pandas/)\n- [Pandas Basics Cheat Sheet](https://www.datacamp.com/community/blog/python-pandas-cheat-sheet)\n- Quick Operations on a Pandas DataFrame\n    - [Renaming Columns in Pandas](http://stackoverflow.com/questions/11346283/renaming-columns-in-pandas) ([video](https://www.youtube.com/watch?v=0uBirYFhizE\u0026list=PL5-da3qGB5ICCsgW1MxlZ0Hq8LL5U3u9y\u0026index=5))\n    - [Deleting Columns from pandas DataFrame](http://stackoverflow.com/questions/13411544/delete-column-from-pandas-dataframe) ([video](https://www.youtube.com/watch?v=gnUKkS964WQ\u0026list=PL5-da3qGB5ICCsgW1MxlZ0Hq8LL5U3u9y\u0026index=6))\n    - [Adding new Column to existing DataFrame](http://stackoverflow.com/questions/12555323/adding-new-column-to-existing-dataframe-in-python-pandas)\n    - [Add one Row in a pandas.DataFrame](http://stackoverflow.com/questions/10715965/add-one-row-in-a-pandas-dataframe)\n    - [Changing the order of DataFrame Columns](http://stackoverflow.com/questions/13148429/how-to-change-the-order-of-dataframe-columns)\n    - [Changing data type of Columns](http://stackoverflow.com/questions/15891038/pandas-change-data-type-of-columns) ([video](https://www.youtube.com/watch?v=V0AWyzVMf54\u0026list=PL5-da3qGB5ICCsgW1MxlZ0Hq8LL5U3u9y\u0026index=13))\n    - [Getting a list of the column headers from a DataFrame](http://stackoverflow.com/questions/19482970/get-list-from-pandas-dataframe-column-headers)\n    - [Converting list of dictionaries to Dataframe](http://stackoverflow.com/questions/20638006/convert-list-of-dictionaries-to-dataframe)\n    - [Getting row count of pandas DataFrame](http://stackoverflow.com/questions/15943769/how-to-get-row-count-of-pandas-dataframe)\n    - [Most efficient way to loop through DataFrames](http://stackoverflow.com/questions/7837722/what-is-the-most-efficient-way-to-loop-through-dataframes-with-pandas)\n    - [Deleting DataFrame row based on column value](http://stackoverflow.com/questions/18172851/deleting-dataframe-row-in-pandas-based-on-column-value)\n    - [Dropping a list of rows from Pandas DataFrame](http://stackoverflow.com/questions/14661701/how-to-drop-a-list-of-rows-from-pandas-dataframe)\n    - [Sorting a DataFrame or a single column](https://www.youtube.com/watch?v=zY4doF6xSxY\u0026list=PL5-da3qGB5ICCsgW1MxlZ0Hq8LL5U3u9y\u0026index=7)\n    - [Filtering DataFrame rows by column value](https://www.youtube.com/watch?v=2AFGPdNn4FM\u0026list=PL5-da3qGB5ICCsgW1MxlZ0Hq8LL5U3u9y\u0026index=8)\n    - [Filtering DataFrame rows using multiple criteria](https://www.youtube.com/watch?v=YPItfQ87qjM\u0026list=PL5-da3qGB5ICCsgW1MxlZ0Hq8LL5U3u9y\u0026index=9)\n    - [Dropping all non-numeric columns from a DataFrame](https://youtu.be/B-r9VuK80dk?t=4m31s)\n    - [Counting and removing missing values](https://www.youtube.com/watch?v=fCMrO_VzeL8\u0026list=PL5-da3qGB5ICCsgW1MxlZ0Hq8LL5U3u9y\u0026index=16)\n    - [Selecting multiple rows and columns from a DataFrame](https://www.youtube.com/watch?v=xvpNA7bC8cs\u0026list=PL5-da3qGB5ICCsgW1MxlZ0Hq8LL5U3u9y\u0026index=19)\n    - [Reducing the size of a DataFrame](https://www.youtube.com/watch?v=wDYDYGyN_cw\u0026list=PL5-da3qGB5ICCsgW1MxlZ0Hq8LL5U3u9y\u0026index=21)\n\n## Machine Learning with Python\n- [AI, ML Related List](https://github.com/svaksha/pythonidae/blob/master/AI.md)\n- [Data Normalization in Python](http://blog.yhat.com/posts/data-normalization-in-python.html)\n- [**Python Machine Learning Book**](https://github.com/rasbt/python-machine-learning-book)\n- [Table of Contents and Code Notebooks](https://github.com/rasbt/python-machine-learning-book/blob/master/README.md#table-of-contents-and-code-notebooks)\n- [Machine Learning with scikit learn](http://www.dataschool.io/machine-learning-with-scikit-learn/)\n- [Machine Learning Algorithms Cheatsheet](http://www.analyticsvidhya.com/blog/2015/09/full-cheatsheet-machine-learning-algorithms/)\n- [**How to compute precision, recall, accuracy and f1-score for the multiclass case with scikit learn?**](http://stackoverflow.com/questions/31421413/how-to-compute-precision-recall-accuracy-and-f1-score-for-the-multiclass-case)\n- [One Hot Encoding for Machine learning in Python](http://stackoverflow.com/questions/17469835/one-hot-encoding-for-machine-learning)\n- [**Building a (semi) Autonomous Drone with Python**](http://blog.yhat.com/posts/autonomous-droning-with-python.html)\n- [Awesome Python - Machine Learning](https://github.com/vinta/awesome-python#machine-learning)\n- Computer Vision\n  - [Awesome Python - Computer Vision](https://github.com/vinta/awesome-python#computer-vision)\n\n## Scikit Learn\n- [scikit learn on Wikipedia](https://en.wikipedia.org/wiki/Scikit-learn)\n- [**Introduction to machine learning with scikit-learn**](https://github.com/justmarkham/scikit-learn-videos), [**Videos!**](http://blog.kaggle.com/author/kevin-markham/)\n- [**A Gentle Introduction to Scikit-Learn: A Python Machine Learning Library**](http://machinelearningmastery.com/a-gentle-introduction-to-scikit-learn-a-python-machine-learning-library/)\n- [**PyData Seattle 2015 Scikit-learn Tutorial**](https://github.com/jakevdp/sklearn_pydata2015), [sklearn_scipy2013](https://github.com/jakevdp/sklearn_scipy2013)\n- [SKLEARN BENCHMARKS: A centralized repository to report scikit-learn model performance across a variety of parameter settings and data sets](https://github.com/rhiever/sklearn-benchmarks), [Report results of sklearn benchmarks at openml.org](http://www.openml.org/)\n- [How to get most informative features for scikit-learn classifiers?](http://stackoverflow.com/questions/11116697/how-to-get-most-informative-features-for-scikit-learn-classifiers)\n- [**Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark**](https://github.com/mapr-demos/spark-sklearn-airbnb-predict)\n- [**Machine Learning with scikit learn tutorial**](http://amueller.github.io/sklearn_tutorial/)\n- [Parallel and Large Scale Machine Learning with scikit-learn](https://speakerdeck.com/ogrisel/parallel-and-large-scale-machine-learning-with-scikit-learn), [Meetup](http://datasciencelondon.org/machine-learning-python-scikit-learn-ipython-dsldn-data-science-london-kaggle/)\n- [Saving classifier to disk in scikit-learn](http://stackoverflow.com/questions/10592605/save-classifier-to-disk-in-scikit-learn)\n\n\n## Linear Regression in Python\n- [Linear Regression in Python](http://nbviewer.ipython.org/github/justmarkham/DAT4/blob/master/notebooks/08_linear_regression.ipynb), [Blog Post](http://www.dataschool.io/linear-regression-in-python/)\n- [Linear Regression using Scikit Learn](http://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LinearRegression.html)\n- [A friendly introduction to linear regression (using Python)](http://www.dataschool.io/linear-regression-in-python/)\n- [Linear Regression Example in Python](http://scipy-cookbook.readthedocs.io/items/LinearRegression.html)\n- [Regression analysis using Python StatsModels package](http://www.turingfinance.com/regression-analysis-using-python-statsmodels-and-quandl/)\n- [Run an OLS regression with Pandas Data Frame](http://stackoverflow.com/questions/19991445/run-an-ols-regression-with-pandas-data-frame)\n\n## Logistic Regression in Python\n- [Logistic Regression with scikit learn](http://www.dataschool.io/logistic-regression-in-python-using-scikit-learn/)\n- [Logistic Regression in Python](http://blog.yhat.com/posts/logistic-regression-and-python.html)\n- [Implementing the softmax function in Python](http://stackoverflow.com/questions/34968722/softmax-function-python)\n- [**What is the inverse of regularization strength in Logistic Regression? How should it affect my code?**](http://stackoverflow.com/questions/22851316/what-is-the-inverse-of-regularization-strength-in-logistic-regression-how-shoul)\n- [The Yhat Blog: Logistic Regression in Python](http://blog.yhat.com/posts/logistic-regression-and-python.html)\n- [Example of logistic regression in Python using scikit-learn](http://www.dataschool.io/logistic-regression-in-python-using-scikit-learn/)\n- [TUTORIAL ON LOGISTIC REGRESSION AND OPTIMIZATION IN PYTHON](https://learningwithdata.wordpress.com/2015/04/30/tutorial-on-logistic-regression-and-optimization-in-python/)\n- [Using Logistic Regression in Python for Data Science](http://www.dummies.com/how-to/content/using-logistic-regression-in-python-for-data-scien.html)\n\n## k Nearest Neighbours in Python\n- [A good tutorial on implementing K Nearest Neighbors using scikit learn](http://scikit-learn.org/stable/modules/neighbors.html)\n- [**Is it possible to specify your own distance function using scikit-learn K-Means Clustering?**](http://stackoverflow.com/questions/5529625/is-it-possible-to-specify-your-own-distance-function-using-scikit-learn-k-means)\n- [Tutorial To Implement k-Nearest Neighbors in Python From Scratch](http://machinelearningmastery.com/tutorial-to-implement-k-nearest-neighbors-in-python-from-scratch/)\n- [Implementing your own k-nearest neighbour algorithm using Python](https://blog.cambridgecoding.com/2016/01/16/machine-learning-under-the-hood-writing-your-own-k-nearest-neighbour-algorithm/)\n- [knn Python implementation on StackOverflow](http://stackoverflow.com/questions/5565935/k-nearest-neighbour-in-python)\n- [kNN with big sparse matrices in Python](http://stackoverflow.com/questions/20333092/knn-with-big-sparse-matrices-in-python)\n- [Sklearn kNN usage with a user defined metric](http://stackoverflow.com/questions/21052509/sklearn-knn-usage-with-a-user-defined-metric)\n\n\n## Neural Networks in Python\n- [Implementing a Neural Network from scratch in Python](http://www.wildml.com/2015/09/implementing-a-neural-network-from-scratch/), [Code](https://github.com/dennybritz/nn-from-scratch)\n- [A Neural Network in 11 lines of Python](http://iamtrask.github.io/2015/07/12/basic-python-network/)\n- [Speeding up your Neural Network with Theano and the gpu](http://www.wildml.com/2015/09/speeding-up-your-neural-network-with-theano-and-the-gpu/), [Code](https://github.com/dennybritz/nn-theano)\n- [What is the best neural network library for Python?](https://www.quora.com/What-is-the-best-neural-network-library-for-Python)\n- [Recurrent Neural Net Tutorial in Python Part 1](http://www.wildml.com/2015/09/recurrent-neural-networks-tutorial-part-1-introduction-to-rnns/), [Part 2](http://www.wildml.com/2015/09/recurrent-neural-networks-tutorial-part-2-implementing-a-language-model-rnn-with-python-numpy-and-theano/), [Code](https://github.com/dennybritz/rnn-tutorial-rnnlm/)\n- [PyBrain: modular Machine Learning Library for Python](http://pybrain.org/)\n- [Neural Networks Tutorial – a Pathway to Deep Learning](http://www.adventuresinmachinelearning.com/neural-networks-tutorial/)\n\n\n## Decision Trees in Python\n- [How to extract the decision rules from scikit-learn decision-tree?](http://stackoverflow.com/questions/20224526/how-to-extract-the-decision-rules-from-scikit-learn-decision-tree)\n- [**How do I find which attributes my tree splits on, when using scikit-learn?**](http://stackoverflow.com/questions/20156951/how-do-i-find-which-attributes-my-tree-splits-on-when-using-scikit-learn)\n- [Quora: What is a good Python library for decision trees?](https://www.quora.com/What-is-a-good-Python-library-for-decision-trees), [StackOverflow](http://stackoverflow.com/questions/3127922/what-is-a-good-python-library-for-decision-trees) \n- [Building Decision Trees in Python](http://www.onlamp.com/pub/a/python/2006/02/09/ai_decision_trees.html?page=1)\n- [Pure Python Decision Trees](http://kldavenport.com/pure-python-decision-trees/)\n- [Building a decision tree from scratch in Python - a beginner's tutorial](http://www.patricklamle.com/Tutorials/Decision%20tree%20python/tuto_decision%20tree.html)\n- [Using Python to Build and Use a Simple Decision Tree Classifier](https://github.com/gumption/Python_for_Data_Science/blob/master/4_Python_Simple_Decision_Tree.ipynb)\n- [Decision trees in python with scikit-learn and pandas](http://chrisstrelioff.ws/sandbox/2015/06/08/decision_trees_in_python_with_scikit_learn_and_pandas.html)\n- [Code for simple decision tree in Python](https://github.com/gumption/Python_for_Data_Science/blob/master/simple_decision_tree.py)\n- [Lesson notebook: Regression and Classification Trees](http://nbviewer.jupyter.org/github/justmarkham/DAT8/blob/master/notebooks/17_decision_trees.ipynb)\n- [Discover structure behind data with decision trees](http://vooban.com/en/tips-articles-geek-stuff/discover-structure-behind-data-with-decision-trees/)\n\n## Random Forest with Python\n- [Getting Started with Random Forests: Titanic Competition on Kaggle](https://www.kaggle.com/c/titanic/details/getting-started-with-random-forests), [Python sample code](https://www.kaggle.com/c/digit-recognizer/forums/t/2299/getting-started-python-sample-code-random-forest)\n- [RandomForestClassifier vs ExtraTreesClassifier in scikit learn](http://stackoverflow.com/questions/22409855/randomforestclassifier-vs-extratreesclassifier-in-scikit-learn)\n- [Powerful Guide to learn Random Forest](http://www.analyticsvidhya.com/blog/2015/09/random-forest-algorithm-multiple-challenges/)\n- [How are Feature Importances in RandomForestClassifier determined?](http://stackoverflow.com/questions/15810339/how-are-feature-importances-in-randomforestclassifier-determined)\n- [Random forest interpretation with scikit-learn](http://blog.datadive.net/random-forest-interpretation-with-scikit-learn/)\n- [Random Forests in Python Tutorial](http://blog.yhat.com/posts/random-forests-in-python.html)\n- [Unbalanced classification using RandomForestClassifier in sklearn](http://stackoverflow.com/questions/20082674/unbalanced-classification-using-randomforestclassifier-in-sklearn)\n- [Random Forest with categorical features in sklearn](http://stackoverflow.com/questions/24715230/random-forest-with-categorical-features-in-sklearn)\n- [How to output RandomForest Classifier from python?](http://stackoverflow.com/questions/23000693/how-to-output-randomforest-classifier-from-python)\n- [Lesson notebook: Ensembling, Bagging, and Random Forests](http://nbviewer.jupyter.org/github/justmarkham/DAT8/blob/master/notebooks/18_ensembling.ipynb)\n\n## Support Vector Machine in Python\n- [Fastest SVM implementation usable in Python](http://stackoverflow.com/questions/9299346/fastest-svm-implementation-usable-in-python)\n- [An example using python bindings for SVM library, LIBSVM](http://stackoverflow.com/questions/4214868/an-example-using-python-bindings-for-svm-library-libsvm)\n- [What is the best SVM library usable from Python?](https://www.quora.com/What-is-the-best-SVM-library-usable-from-Python)\n- [How does sklearn.svm.svc's function predict_proba() work internally?](http://stackoverflow.com/questions/15111408/how-does-sklearn-svm-svcs-function-predict-proba-work-internally)\n- [Support vector machine in Python using libsvm example of features](http://stackoverflow.com/questions/30991592/support-vector-machine-in-python-using-libsvm-example-of-features)\n- [Linear SVC Machine learning SVM example with Python](https://pythonprogramming.net/linear-svc-example-scikit-learn-svm-python/)\n- [Understanding Support Vector Machine algorithm from examples (along with code)](http://www.analyticsvidhya.com/blog/2015/10/understaing-support-vector-machine-example-code/)\n\n## NLP / Text Mining in Python\n- [**NLP with Python ORiley Book**](http://www.nltk.org/book_1ed/), [Python 3](http://www.nltk.org/book/)\n- [Awesome Python - NLP](https://github.com/vinta/awesome-python#natural-language-processing)\n- [Awesome Python - Text Processing](https://github.com/vinta/awesome-python#text-processing)\n- [Text Analytics : Intro and Tokenization](http://a4analytics.blogspot.sg/2015/03/text-mining-post-1.html)\n- [NLTK BOOK](http://www.nltk.org/book/ch01.html)\n- [Elegant N-gram Generation in Python](http://locallyoptimal.com/blog/2013/01/20/elegant-n-gram-generation-in-python/)\n- [**Computing N Grams using Python**](http://stackoverflow.com/questions/13423919/computing-n-grams-using-python)\n- [N-grams: Explanation + 2 applications](http://stackoverflow.com/questions/1032288/n-grams-explanation-2-applications)\n- [NLP Tutorial with Python](http://www.datasciencecentral.com/profiles/blogs/python-nlp-tools)\n\n## Sentiment Analysis with Python\n- [A Comprehensive Guide to Sentiment Analysis](https://monkeylearn.com/sentiment-analysis/)\n- [Twitter-Sentiment-Analysis](https://github.com/ujjwalkarn/Twitter-Sentiment-Analysis)\n- [Basic Sentiment Analysis with Python](http://fjavieralba.com/basic-sentiment-analysis-with-python.html)\n- [What is the best way to do Sentiment Analysis with Python?](https://www.quora.com/What-is-the-best-way-to-do-Sentiment-Analysis-with-Python-1)\n- [How to Calculate Twitter Sentiment Using AlchemyAPI with Python](http://www.alchemyapi.com/developers/getting-started-guide/twitter-sentiment-analysis)\n- [Second Try: Sentiment Analysis in Python](http://andybromberg.com/sentiment-analysis-python/)\n- [Sentiment Analysis with Python NLTK Text Classification](http://text-processing.com/demo/sentiment/)\n- Codes and Explanation\n    - [**Sentiment Analysis with bag-of-words**](http://ataspinar.com/2016/01/21/sentiment-analysis-with-bag-of-words/)\n    - [**Sentiment Analysis with Naive Bayes**](http://ataspinar.com/2016/02/15/sentiment-analysis-with-the-naive-bayes-classifier/)\n\n## Pickle: convert a python object into a character stream\n- [Python serialization - Why pickle?](http://stackoverflow.com/questions/8968884/python-serialization-why-pickle)\n- [**Serializing Python Objects**](http://www.diveinto.org/python3/serializing.html), [**Binary Files**](http://www.diveinto.org/python3/files.html#binary)\n- [What is Pickle in python ?](https://pythontips.com/2013/08/02/what-is-pickle-in-python/)\n- [How to cPickle dump and load separate dictionaries to the same file?](http://stackoverflow.com/questions/11641493/how-to-cpickle-dump-and-load-separate-dictionaries-to-the-same-file)\n- [**Understanding Pickling in Python**](http://stackoverflow.com/questions/7501947/understanding-pickling-in-python)\n\n## AutoML\n- [TPOT: A Python tool for automating data science](http://www.randalolson.com/2016/05/08/tpot-a-python-tool-for-automating-data-science/), [GitHub repo](https://github.com/rhiever/tpot)\n\n## Regex Related\n- [RegExr](http://regexr.com/)\n- [Regex101](https://regex101.com/)\n- [Pythex](http://pythex.org/)\n- [How to use Regular Expressions (Regex) in Microsoft Excel both in-cell and loops](http://stackoverflow.com/questions/22542834/how-to-use-regular-expressions-regex-in-microsoft-excel-both-in-cell-and-loops)\n- [Advanced Filters: Excel’s Amazing Alternative To Regex](http://searchengineland.com/advanced-filters-excels-amazing-alternative-to-regex-143680)\n\n## Shell Scripting\n- [**Calling an external command in Python**](http://stackoverflow.com/questions/89228/calling-an-external-command-in-python)\n- [**Running shell command from Python and capturing the output**](http://stackoverflow.com/questions/4760215/running-shell-command-from-python-and-capturing-the-output)\n- [**Can I use Python as a bash replacement?**](http://stackoverflow.com/questions/209470/can-i-use-python-as-a-bash-replacement)\n- [Python Scripts as a Replacement for Bash Utility Scripts](http://www.linuxjournal.com/content/python-scripts-replacement-bash-utility-scripts)\n- [How to Write a Shell Script using Bash Shell in Ubuntu](https://www.youtube.com/watch?v=He-5BpUGSag)\n- Red Hat Magazine | Python for Bash scripters: A well-kept secret\n- [Embed bash in python](http://stackoverflow.com/questions/2651874/embed-bash-in-python)\n- [Bash2py: A Bash to Python Translator](https://cs.uwaterloo.ca/~ijdavis/bash2py-final.pdf)\n- [Beginners/BashScripting](https://help.ubuntu.com/community/Beginners/BashScripting)\n- [The Beginner’s Guide to Shell Scripting: The Basics](http://www.howtogeek.com/67469/the-beginners-guide-to-shell-scripting-the-basics/)\n- [Linux Shell Scripting Tutorial v1.05r3 A Beginner's handbook](http://www.freeos.com/guides/lsst/)\n\n## Other good lists\n- [pycrumbs - Bits and bytes of Python from the Internet](https://github.com/kirang89/pycrumbs)\n- [python github projects - Collect and classify python projects on Github](https://github.com/checkcheckzz/python-github-projects)\n- [python reference - Useful functions, tutorials, and other Python-related things](https://github.com/rasbt/python_reference)\n- [pythonidae - Curated decibans of scientific programming resources in Python](https://github.com/svaksha/pythonidae)\n","funding_links":[],"categories":["Python","Technical","Data Science \u0026 Machine Learning","Uncategorized","Lists of libraries","🐍 Python","More advanced resources and lists","Awesome lists","Data Science","Data Science ##"],"sub_categories":["ramanihiteshc@gmail.com","Uncategorized","Mathematics","Resources","Various other helpful tools and resources","Frameworks / ecosystems","Python"],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fujjwalkarn%2FDataSciencePython","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fujjwalkarn%2FDataSciencePython","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fujjwalkarn%2FDataSciencePython/lists"}