https://github.com/mkcor/exp-testing

Demo for reading and testing experimental data.
https://github.com/mkcor/exp-testing

Last synced: 5 months ago
JSON representation

Demo for reading and testing experimental data.

Host: GitHub
URL: https://github.com/mkcor/exp-testing
Owner: mkcor
License: cc0-1.0
Created: 2017-10-29T03:32:53.000Z (over 8 years ago)
Default Branch: master
Last Pushed: 2017-11-16T20:47:57.000Z (over 8 years ago)
Last Synced: 2025-02-21T12:44:48.949Z (over 1 year ago)
Language: HTML
Size: 1.47 MB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # Loading, testing, and wrangling experimental data

[![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/mkcor/exp-testing/master)

Guest Lecture at [PHYS 257](https://www.mcgill.ca/study/2017-2018/courses/phys-257)

*November 16, 2017*

## Summary

We showcase the power and flexibility of Pandas (Python library) for analyzing

experimental datasets. We use real data, acquired in a physics experiment

designed to study the flow of superfluid helium in a low-dimensional setup.

Pandas loads the (tabular) data into a DataFrame (2-dimensional labelled data

structure). Pandas lets you handle web-hosted data, heterogeneous data (e.g.,

some columns are numeric and others are character strings), missing data,

comments (e.g., lab notes). Pandas also offers native timeseries support. We

cover some best practices for testing data and statistics (distributions). We

introduce the concept of tidy data, which results from wrangling the raw data

so that they become easy to work with (i.e., transform, visualize, model).

## Local setup

We use cross-platform package manager [conda](https://conda.io/).

We recommend using the [Miniconda](https://conda.io/miniconda.html)

distribution. Once you have downloaded your Miniconda installer, run

the following command (adapt if necessary):

    $ bash ~/Downloads/Miniconda3-latest-Linux-x86_64.sh

and follow the installation steps. Now create a sandboxed environment

for this project:

    $ conda env create -f environment.yml

    $ source activate advanced-pandas

    $ jupyter notebook

If you edit file `environment.yml` (to add or update a dependency), then

run:

    $ conda env update -f environment.yml

## Remote setup

Check out this

[draft](https://www.authorea.com/users/153798/articles/213273-deploying-computing-environments).

## References

* [PyData 101 slides](https://speakerdeck.com/jakevdp/pydata-101)

by Jake VanderPlas

* [Tidy Data article](https://www.jstatsoft.org/article/view/v059i10/)

by Hadley Wickham

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/mkcor/exp-testing

Awesome Lists containing this project

README