Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/varungitgood/cinematch

expressjs flask-api nlp-machine-learning python3 reactjs tailwindcss

Last synced: 21 days ago
JSON representation

Host: GitHub
URL: https://github.com/varungitgood/cinematch
Owner: VarunGitGood
Created: 2023-01-21T09:14:13.000Z (almost 2 years ago)
Default Branch: main
Last Pushed: 2023-01-25T10:54:41.000Z (almost 2 years ago)
Last Synced: 2024-11-01T09:11:57.630Z (2 months ago)
Topics: expressjs, flask-api, nlp-machine-learning, python3, reactjs, tailwindcss
Language: Jupyter Notebook
Homepage:
Size: 9.78 MB
Stars: 1
Watchers: 1
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Cinematch
Cinematch is a movie/series recommendation platform uses a cutting-edge machine learning model to personalize movie suggestions based on your interests.
### Features

* It uses ML model to recommend moviesand web series based on your preference.
* Allow you to create multiple watchlists according to your mood
* Option to share your watchlists with friends
* Provides related movie/series suggestions based on recent films/series you've watched
* Regularly updated library of films and series to explore
* Discover new movies and series with personalized recommendations tailored to your preferences.
## Tech Architecture
![Frame 1](https://user-images.githubusercontent.com/68912239/213908685-b77be43e-fe10-49c4-8438-aa963711e4c7.png)

### Data Scrapping
* This code is used for scraping movie data from the IMDb website. It uses the Python library requests to send GET requests to the IMDb search page, and the library BeautifulSoup to parse the HTML returned by the requests.
* The script starts by setting the maximum number of pages to scrape (max_tiles) and the base URL for the IMDb search page. It then iterates through a range of page numbers (incrementing by 50 each time) and sends a GET request to the URL with the current page number appended to it.

* The HTML returned by the request is parsed using BeautifulSoup, and the script looks for specific elements on the page that contain the information we want (such as movie title, year, genre, cast, etc.). The script then retrieves this information and writes it to a csv file, with each row representing a different movie.

* The script also uses a try-except block to handle any errors that may occur when scraping the data, such as if a specific element is not found on a page.

### Express Backend
* Centralized backend to manage all the microservices.
* Routes for auth, follow, unfollow, create llist, get list, profile etc.
* database connection
### Recommendation Model
The process of proposing movies to consumers is called movie recommendation,
and it is based on their watching tastes and history.
Utilizing cosine similarity is one method of doing this.

* In our Model, We are using CountVectorizer to create a matrix of token counts from a collection of text documents for a movie recommendation. The similarity between the films depending on their genre is then determined using this matrix as input to a similarity measure like cosine similarity. The higher the similarity value, the more similar the movies are considered to be. CountVectorizer is used to create the matrix of token counts by tokenizing the text and counting the number of occurrencesof each token in each document. This allows for a more accurate representation of the text data and more accurate similarity calculations.
* The cosine of the angle between any two non-zero vectors in an inner product space is used to quantify how similar the vectors are. The calculation of text or document similarity is frequently employed in information retrieval andnatural language processing. Cosine similarity may be used to compare moviesbased on its properties, such as genre, narrative, and cas, and to determine how similar various viewers' choices are to one another.

## Run Locally

Clone the project

```bash
git clone https://link-to-project
```

### Start the Express server

Go to the project directory

```bash
cd Cinematch/Backend
```

Install dependencies

```bash
npm install
```

Start the server

```bash
npm run dev
```

### Start the Flask server

Go to the project directory

```bash
cd CineMatch/FlaskApi
```

Install dependencies

```bash
pip install requirements.txt
```

Start the server

```bash
python app.py
```

### Start the vite server

Go to the project directory

```bash
cd CineMatch/Frontend
```

Install dependencies

```bash
npm install
```

Start the server

```bash
npm run dev
```