Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

awesome-datascience

Stuff might be used in web scraping and data science
https://github.com/oiwn/awesome-datascience

  • grab - Grab is a python web scraping framework.
  • scrapy - An open source and collaborative framework for extracting the data you need from websites. In a fast, simple, yet extensible way.
  • phantomjs - PhantomJS is a headless WebKit scriptable with a JavaScript API. It has fast and native support for various web standards: DOM handling, CSS selector, JSON, Canvas, and SVG.
  • SciPy - includes modules for linear algebra, optimization, integration, special functions, signal and image processing, statistics, genetic algorithms, ODE solvers, and others.
  • Pandas - pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.
  • SciKit-Learn - machine learning in Python