An open API service indexing awesome lists of open source software.

https://github.com/ejw-data/python-sentences

Simple script extracting paragraphs, sentences, words, and characters by using regular expressions and simple data structures.
https://github.com/ejw-data/python-sentences

python regular-expressions

Last synced: over 1 year ago
JSON representation

Simple script extracting paragraphs, sentences, words, and characters by using regular expressions and simple data structures.

Awesome Lists containing this project

README

          

# python-sentences

Auhtor: Erin James Wills, ejw.data@gmail.com

![Sentences](./images/py-sentences.png)

## Overview


There is not much to this script except that I was looking at analyzing some text documents and was practicing using `regular expressions` and extracting the data into different data structures.

**`Note`**: I did not pursue this in length because I do not regularly utilize Natural Language Processing (NLP) in my current project loads; although, I do find some applications interesting like analyzing social media and comments section responses and news media transcripts. I have one personal project in the queue to analyze MSNBC, FOX, and CNN transcripts over different time periods and scales to see the patterns and trends.

## Technologies
* Python


## Data Source
Dataset generated by Trilogy Education Services. Origins beyond this is unknown.


## Setup and Installation
1. Environment needs the following:
* Python 3.6+
* os
* csv
* re
* time
1. Activate your environment
1. Clone the repo to your local machine
1. Navigate the terminal to the repo folder
1. In the terminal, run `python sentences.py`