https://github.com/callum-op/video-pose-finder
An app that will return the poses of a person in a video, using a backend to clean the data for it to be used in external 3D modelling programs.
https://github.com/callum-op/video-pose-finder
animation bvh-files css html javascript pose-tracking react tailwind vite
Last synced: 1 day ago
JSON representation
An app that will return the poses of a person in a video, using a backend to clean the data for it to be used in external 3D modelling programs.
- Host: GitHub
- URL: https://github.com/callum-op/video-pose-finder
- Owner: Callum-OP
- Created: 2026-05-23T10:43:43.000Z (11 days ago)
- Default Branch: main
- Last Pushed: 2026-05-30T15:12:59.000Z (4 days ago)
- Last Synced: 2026-05-30T17:09:06.100Z (4 days ago)
- Topics: animation, bvh-files, css, html, javascript, pose-tracking, react, tailwind, vite
- Language: JavaScript
- Homepage: https://callum-op.github.io/Video-Pose-Finder/
- Size: 14.1 MB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Screenshots

# How to run
If you want to test out the online version just go to: https://callum-op.github.io/Video-Pose-Finder/
For the local version, you will need to set up dependencies, assuming you have node installed, use: npm install
Then to run use: npm run dev
It should be accessible locally on http://localhost:5173/
# Limitations
Trying to track several people at the same time is unreliable, so this app can only track one person at a time, even then in a video with several people if they overlap with each other it can easily get confused. While videos with only one person or videos where one person can be seen clearly it is much more perfect at tracking them.
It struggles with being able to tell if the character has turned around, turning their limbs but making the hips remain still facing forward, resulting in the body being twisted in ways that are impossible. Similarily if the camera is movinf around the character it will result in the same issues, so far it assumes the character is facing the camera.
# Features To Consider
Could have an advanced option that lets the user move or place the skeleton where it should be if that helps the program that tracks it know where to start.
Add warning message when failed to detect multiple people.
Python backend that uses Gemini AI to tell which orientation the people in the video are facing as well as how many people there are, dealing with both the current limitations of the app at once.
A prompt for this could look like:
Prompt: "You are analysing a single frame from a motion capture video for a pose
estimation pipeline. Your only task is to determine the orientation of
the person's body relative to the camera. Analyze the frame and return ONLY a JSON object, no explanation:
{
"yaw_degrees": ,
"confidence": ,
"view": <"front" | "side_left" | "side_right" | "rear_3q_left" |
"rear_3q_right" | "rear">,
"notes":
}Classification boundaries:
front: |yaw| < 45°
side: 45° – 135°
rear_3q: 135° – 160°
rear: |yaw| > 160°
Do not describe the image. Return only the JSON object.
Could also use Gemini AI to tidy up the pose data, filling in missing limbs, fixing inaccuracies and other tasks to ensure that the final pose sequence is complete and usable.
Save pose sequences? Then the user can access them whenever they want. Would also be useful to save settings. Could just save them to local storage or use something like MongoDB.