https://github.com/ejw-data/python-sentences
Simple script extracting paragraphs, sentences, words, and characters by using regular expressions and simple data structures.
https://github.com/ejw-data/python-sentences
python regular-expressions
Last synced: over 1 year ago
JSON representation
Simple script extracting paragraphs, sentences, words, and characters by using regular expressions and simple data structures.
- Host: GitHub
- URL: https://github.com/ejw-data/python-sentences
- Owner: ejw-data
- Created: 2020-06-28T06:06:34.000Z (about 6 years ago)
- Default Branch: master
- Last Pushed: 2022-06-27T13:51:49.000Z (about 4 years ago)
- Last Synced: 2025-01-22T06:47:19.363Z (over 1 year ago)
- Topics: python, regular-expressions
- Language: Python
- Homepage:
- Size: 124 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# python-sentences
Auhtor: Erin James Wills, ejw.data@gmail.com

## Overview
There is not much to this script except that I was looking at analyzing some text documents and was practicing using `regular expressions` and extracting the data into different data structures.
**`Note`**: I did not pursue this in length because I do not regularly utilize Natural Language Processing (NLP) in my current project loads; although, I do find some applications interesting like analyzing social media and comments section responses and news media transcripts. I have one personal project in the queue to analyze MSNBC, FOX, and CNN transcripts over different time periods and scales to see the patterns and trends.
## Technologies
* Python
## Data Source
Dataset generated by Trilogy Education Services. Origins beyond this is unknown.
## Setup and Installation
1. Environment needs the following:
* Python 3.6+
* os
* csv
* re
* time
1. Activate your environment
1. Clone the repo to your local machine
1. Navigate the terminal to the repo folder
1. In the terminal, run `python sentences.py`