Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/vermavinay982/tex-tractor

Searching Words in Live Video using OCR & Computer Vision
https://github.com/vermavinay982/tex-tractor

computer-vision hacktoberfest hacktoberfest2022 ocr-recognition python video

Last synced: 9 days ago
JSON representation

Searching Words in Live Video using OCR & Computer Vision

Awesome Lists containing this project

README

        

# TexTractor - Video text searching utility

Goal: Trying to find the specific text in a video
Motivation: While watching tutorial, need to find a specific word on screen, but had to watch the whole video. So thought to make it for the world.
Benefits: faster inference on videos, better analytics, threading for performance, multi processing

![Image](assets/gui_look.gif)
![Image](assets/Demo.gif)

it will give the frame, or timestamp where the text was present

for all words - timestamps will be generated
if the user searched few words - all those timestamps will be returned to user

- Find all the text in complete video, mark red the low confidence text, and black with confident text
- Allows you to find certain words, and get that slide created for you
- If you need the last page where the word was written, you can get that too by going through all pages
- Can summarize the whole video, and provide you the slides, super easy to read in emergency
- Works on your laptop, so no issue of privacy

## More Points
- search all at once
- search more
- getting range of video where text can be present - for faster search
- location of detected text also can be mentioned
- tracking of text can be done - for each frame - to reduce load on processing
- for each time stamp the text can be found out
- option to give frame skipping option to user is beneficial
- showing range where the word was present, some words are constantly present
- can highlight for each word by user where it was present
- close to it, could also be added - to make suggestion to user
- streamlit for deployment and params

Mini Goals
- find best text detection model
- infer it on single frames
- generate frames using cv2
- save data and present it in csv form
- use trees and other algorithms to merge the presense of word in all images