https://github.com/linkedinlearning/python-statistics-essential-training-4433355
This is a repository for the LinkedIn Learning course Python Statistics Essential Training
https://github.com/linkedinlearning/python-statistics-essential-training-4433355
Last synced: 9 months ago
JSON representation
This is a repository for the LinkedIn Learning course Python Statistics Essential Training
- Host: GitHub
- URL: https://github.com/linkedinlearning/python-statistics-essential-training-4433355
- Owner: LinkedInLearning
- License: other
- Created: 2023-06-12T15:18:20.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2023-08-18T18:37:23.000Z (over 2 years ago)
- Last Synced: 2025-06-09T18:11:52.899Z (10 months ago)
- Language: Jupyter Notebook
- Size: 1.19 MB
- Stars: 45
- Watchers: 8
- Forks: 107
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Codeowners: .github/CODEOWNERS
Awesome Lists containing this project
README
# Python Statistics Essential Training
This is the repository for the LinkedIn Learning course Python Statistics Essential Training. The full course is available from [LinkedIn Learning][lil-course-url].
![Python Statistics Essential Training][lil-thumbnail-url]
The field of statistics has become increasingly dependent on data analysis and interpretation using Python. With the rise of big data and data science, the demand for professionals who can effectively analyze and interpret data using Python has skyrocketed. In this course, Matt Harrison teaches you how to collect, clean, analyze, and visualize data using the powerful tools of the Python programming language. Join Matt as he gives into the various techniques that form the backbone of statistics and helps you understand the data with summary statistics and visualizations. He explains how to create predictive models using both linear regression andXGBoost, and wraps up the course with a look at hypothesis testing.
If you’re interested in exploring statistics using a code-first approach, join Matt in this course as he shows you how to use Python to unlock the power of data.
## Instructions
You can run the notebook using:
* Codespaces
* Locally
## Running in GitHub Codespaces
You can use GitHub Codespaces to run this codebase in a fully configured, cloud-hosted development environment:
1. Click the green "Code" button on the main repository page.
2. Click on "Open with Codespaces" and then "New codespace."
3. Open up PyStats.ipynb
## Running Locally
To run this codebase locally, you'll need to set up a Python virtual environment and install the necessary packages. Follow these steps:
1. Ensure you have Python installed on your system. If not, you can download it from [here](https://www.python.org/downloads/).
2. Clone this repository into your local machine using the terminal (Mac), CMD (Windows), or a GUI tool like SourceTree.
3. Navigate to the repository folder via your terminal.
4. Create a virtual environment. You can do this by running:
```
python -m venv env
```
This will create a new Python virtual environment in a folder named `env`.
5. Activate the virtual environment. On macOS and Linux, run:
```
source env/bin/activate
```
On Windows, run:
```
.\env\Scripts\activate
```
6. Install the necessary packages using pip by running:
```
pip install -r requirements.txt
```
This will install all the packages listed in `requirements.txt` to your virtual environment.
7. Launch jupyter
```
jupyter lab
```
8. Open up PyStats.ipynb
## Solutions
The PyStats-Solutions.ipynb file has the solutions for the exercises.
### Instructor
Matt Harrison
Check out my other courses on [LinkedIn Learning](https://www.linkedin.com/learning/instructors/matt-harrison).
[lil-course-url]: https://www.linkedin.com/learning/python-statistics-essential-training-19258005?dApp=59033956&leis=LAA
[lil-thumbnail-url]: https://media.licdn.com/dms/image/D4D0DAQGb_695b92Eyw/learning-public-crop_288_512/0/1692131795957?e=2147483647&v=beta&t=3GG4GDOLjFGiviV1YZ22z12GRTj0dlptXTBjIPVnJZM