Projects in Awesome Lists by PRITHIVSAKTHIUR
A curated list of projects in awesome lists by PRITHIVSAKTHIUR .
https://github.com/prithivsakthiur/how-to-run-huggingface-spaces-on-local-machine-demo
Running Hugging Face Spaces on a local machine / colab T4 GPU involves several steps. Hugging Face Spaces is a platform to host machine learning demos and applications using Streamlit, Gradio, or other frameworks.
Last synced: 02 Jul 2025
https://github.com/prithivsakthiur/stable-hamster
Fast as Hamster | Stable Hamster | Stable Diffusion
4k bot diffusion-models fast hamster hdr huggingface huggingface-transformers image image-generation modeling
Last synced: 16 Aug 2025
https://github.com/PRITHIVSAKTHIUR/FLUX-REALISM
Enhancing Face Realism / Epic Realism [ LoRA ]
black-forest-labs flux flux-dev flux-schnell gradio gradio-interface image lora realism text
Last synced: 16 Aug 2025
https://github.com/prithivsakthiur/video-background-remover
Remove Backgroud from a video π§€π₯
a100 background gpu gradio gradio-interface huggingface models nvidia opencv-python remover-tool video
Last synced: 06 May 2025
https://github.com/prithivsakthiur/gallo-3xl
High Quality Image Generation Model - Powered with NVIDIA A100
ai dall-e dalle2 dalle3 diffusers gradio huggingface image-generation peft peft-fine-tuning-llm text-to-image torch transformers
Last synced: 16 Aug 2025
https://github.com/prithivsakthiur/pdf-word-converter
PDF to Word converter is an easy-to-use, in-browser tool that allows you to quickly and securely export a PDF to a Microsoft Word file format (.doc) ...
app converter docx gradio huggingface pdf pdf-converter python to word
Last synced: 06 May 2025
https://github.com/prithivsakthiur/magic-eraser-tool
You don't have to worry about mastering photo editing techniques to remove an object from your photo. ** Simply mark over the areas you want to erase, and our AI will take care of the rest.
app docker eraser magic ml numpy python streamlit streamlit-webapp tensorflow
Last synced: 06 May 2025
https://github.com/prithivsakthiur/huggingface-android-application
URL to App Conversion
apk apktool app application drive go google html hugging-face java javascript kotlin median native procedural readme sdk shell steps url
Last synced: 11 Jul 2025
https://github.com/prithivsakthiur/imagineo-4k
Midjourney X Instant Collage -- Collage Template + Grid + Quality Style
collage computer-vision dalle dalle-3 diffusion-models fast gpu grid image image-generation image-generation-ai image-processing midjourney nvidia-gpu pytorch texttoimage zero
Last synced: 16 Aug 2025
https://github.com/prithivsakthiur/save-web-as-zip
Save any web url as zip ( image + assets + html + css + js )
beatifulsoup beautifulsoup4 huggingface spaces web webscraping webscraping-data website zip
Last synced: 06 May 2025
https://github.com/prithivsakthiur/face-swap-roop
Face-Swapper | Gradio Work Space | .hf.space
face gradio gradio-interface image image-image image-processing model python roop swap
Last synced: 16 Aug 2025
https://github.com/prithivsakthiur/hospital-management-system
Hospital Management System Using StreamLit Application
cloud community model python3 stream streamlit-webapp webapp
Last synced: 06 May 2025
https://github.com/prithivsakthiur/rag-pdf-chatbot
(PDF) Information and Inference, Retrieval-Augmented Generation [ RAG ]
llm packages-manager pdf-document-processor pdf-viewer streamlit
Last synced: 06 May 2025
https://github.com/prithivsakthiur/strangerx-multimodal-chatbot
Multimodal-ChatBot-App, LLM Model Id Doesn't work outside, since it is opeartable inside the Huggingface Models. Recommended to Deploy inside Huggingface Spaces SDK as StreamLit.
api chatbot chatbotapp chatgpt machine-learning mistral-7b model modelidentifire multimodal phi streamlit streamlit-webapp
Last synced: 28 Jul 2025
https://github.com/prithivsakthiur/ai-art-generator-sdxl
AUTOMATIC1111: Software for tensor operations, saving tensor data in .safetensors format. ComfyUI: UI library, possibly managing tensor data safely with *.safetensors. InvokeAI: ML platform using *.safetensors for secure tensor storage.
api art gradio gradio-custom-component gradio-interface gradio-python-llm identity llm machine-learning model modeling sdxl
Last synced: 19 Oct 2025
https://github.com/prithivsakthiur/prompt-extender-gradio
The "Prompt Extender" is a tool or concept that can be applied in various fields, from artificial intelligence to creative writing. Its primary function is to expand or elaborate on a given prompt or idea, providing additional context, details, or possibilities.
Last synced: 06 May 2025
https://github.com/prithivsakthiur/image-captioning-salesforce-blip
The BlipProcessor and BlipForConditionalGeneration are likely classes specific to a model called "Blip," which seems to be a transformer-based model for conditional text generation.
captioning-images image-processing
Last synced: 06 Jul 2025
https://github.com/prithivsakthiur/medical-map-streamlit
Medical Map - Hospital Location & Details πΊοΈποΈ
google hospital-management map mapping medicalmap
Last synced: 29 Jun 2025
https://github.com/prithivsakthiur/strangerai
Turning Ideas to Product - StrangerAI - StrangerZone. Recommended to Deploy inside Huggingface Spaces SDK as GRADIO
api chat-application chatbot chatgpt llm-inference open-source openai openapi
Last synced: 16 Aug 2025
https://github.com/prithivsakthiur/airbnb-nyc-maps
Airbnb Price in NYC ( Select Boroughs )
airbnb classifiers gradio huggingface map maps nyc plotly point price price-tracker pyplot range space
Last synced: 28 Jun 2025
https://github.com/prithivsakthiur/imagegpt-4xl
Image Knowledge base / Idea / Problem Solve / Analysis
gradio huggingface huggingface-transformers image instructor microsoft phi python question-answering
Last synced: 06 May 2025
https://github.com/prithivsakthiur/plant-diseases-cnn
Model trained for educational purposes only; usage subject to terms and conditions.
analysis analysis-algorithms cnn disease-prediction git github-config gradio h5 lfs model plant training
Last synced: 06 May 2025
https://github.com/prithivsakthiur/vision-inference
What Happen Next ? Live Inference
css docker html inference-engine javascript live model
Last synced: 06 May 2025
https://github.com/prithivsakthiur/1000-general-knowledge-flashcards
1000 Flashcards ( General, Sports, Technical,Space ) ππ
1000 flashcards general gradio hugging-face knowledge python questions quiz spaces
Last synced: 06 May 2025
https://github.com/prithivsakthiur/plant-status-gradio
πUpload a photo of a plant to see how the model classifies its status!π
classifier disease-prediction gradio h5 mbs model plants
Last synced: 06 May 2025
https://github.com/prithivsakthiur/web-data-scraper
Data text successfully scraped! - Put & Get
Last synced: 06 May 2025
https://github.com/prithivsakthiur/youtube-video-downloader
Enter YouTube link π To Download Videoβ¬οΈ
Last synced: 06 May 2025
https://github.com/prithivsakthiur/bert-uncased
BERT is a transformers model pretrained on a large corpus of English data in a self-supervised fashion. This means it was pretrained on the raw texts only, with no humans labeling them in any way (which is why it can use lots of publicly available data) with an automatic process to generate inputs and labels from those texts.
bert betelgeuse google llm model package text text-classification transformer
Last synced: 06 May 2025
https://github.com/prithivsakthiur/medical-term-article-search
HealthCare-Informatics-MediSearch
api client information-retrieval json jsonwebtoken machine-learning medical medisearch python3 records results solutions streamlit streamlit-webapp webapp
Last synced: 16 Aug 2025
https://github.com/prithivsakthiur/text-tokenizer-playground
Text Tokenizer Playground ( Transformers.js ) SDK in Hugginface.
css html js playground text tokenization tokenization-client
Last synced: 06 May 2025
https://github.com/prithivsakthiur/top-prompt-collection
A prompt library is a collection of image prompts that can be used to generate new and interesting image.
collections css html huggingface list prompt queue static-site webpage website
Last synced: 02 Sep 2025
https://github.com/prithivsakthiur/bidirectional-and-auto-regressive-transformer-cnn
BARTβs primary task is used to generate clean semantically coherent text from corrupted text data but it can also be used for a variety of different NLP sub-tasks like language translation, question-answering tasks, text summarization, paraphrasing, etc.
Last synced: 06 May 2025
https://github.com/prithivsakthiur/master-gpt
Chat, Web, Media, Image GPT
api chat-application chatbot chatgpt demo fast huggingface image imagegpt mode muti-process search searchenginemarketing web zip
Last synced: 06 May 2025
https://github.com/prithivsakthiur/chatbot-gpt
3-In-1-Chatbot - GPT
ai app chat-application chatbot chatgpt llm mistral multimodal streamlit-webapp
Last synced: 06 May 2025
https://github.com/prithivsakthiur/all-in-one-downloader
yt-dlp is a feature-rich command-line audio/video downloader with support for thousands of sites. The project is a fork of youtube-dl based on the now inactive youtube-dlc.
downloader gradio huggingface instagram pkt python reddit twitter video workspace x youtube yt-dlp
Last synced: 06 May 2025
https://github.com/prithivsakthiur/strangerzone-chatbot
Turning Ideas to Product - StrangerAI - StrangerZone - Zero Data Collection from User.
Last synced: 16 Aug 2025
https://github.com/prithivsakthiur/canopus-realism
Realistic Image Generation, Realistic trigger works properly, better for photorealistic trigger words, close-up shots, face diffusion, male, female characters.
computer-vision gradio lora model realism realistic sdxl stable-diffusion text2image
Last synced: 16 Aug 2025
https://github.com/prithivsakthiur/orpheus-tts-edge
Play with Orpheus TTS, a Llama-based Speech-LLM designed for high-quality, empathetic text-to-speech generation. This model has been fine-tuned to deliver human-level speech synthesis π₯π£οΈ
gradio gradio-python-llm huggingface-transformers llama llm orpheus tts
Last synced: 12 Jul 2025
https://github.com/prithivsakthiur/stablediffusion
Continuous progress in AI research leads to the development of more robust algorithms, models, and techniques, making AI solutions more effective and reliable.
ai api apiurlrequest app generation genetic-algorithm gradio huggingface image llama llm python sdxl stability-ai stable-diffusion transformer
Last synced: 16 Aug 2025
https://github.com/prithivsakthiur/stable-wallpapers
Demo space for generating, Desktop / Mobile Wallpapers. 16:9 / 9:16 #Dream Wallpaper by Stable Wallpaper [ stable diffusion xl ]
diffusion-models gradio huggingface huggingface-transformers lightning models stable-diffusion texttoimage transformer wallpapers
Last synced: 06 May 2025
https://github.com/prithivsakthiur/client-record-curd-ops-exercise
Client Record Management - CURD OPs + Blazor Web Assembly with Standalone App
blazor client csharp curd-operation dotnet dotnet-core entity-framework mangement microsoftsqlserver records webapi
Last synced: 09 Jul 2025
https://github.com/prithivsakthiur/gen-vision
Multiple Conditioned Image Generation, SDXL, Low-rank adaptation Refined
colab-notebook cookbooks diffusers gen gradio huggingface imagegeneration lora spaces stable-diffusion texttoimage transformer
Last synced: 16 Aug 2025
https://github.com/prithivsakthiur/yolox-t4
Ultralytics, YOLO v8 - Computer Vision
computer-vision engine inference ultralytics video yolo yolov8
Last synced: 05 Oct 2025
https://github.com/prithivsakthiur/yolo-nms-captioning
Object Detection - Captioning ( yolo8n & blip-image-captioning-large )
Last synced: 24 Oct 2025
https://github.com/prithivsakthiur/yolox-cpu
Ultralytics, YOLO v8 - Computer Vision
counting heatmap-visualization inference object object-detection speedestimation tensorflow ultralytics video videointelligence workout-tracker yolo yolov8
Last synced: 11 Oct 2025
https://github.com/prithivsakthiur/text-to-image
Text to Image Gen [ Demo ]
a100 ai colab-notebook demo face genai hugging huggingface image image-generation image-processing ipython-notebook jupyter-notebook t4 text transformer
Last synced: 29 Jul 2025
https://github.com/prithivsakthiur/llama-3.2-1b-gguf-fast-gradio
GPT-Generated Unified Format
Last synced: 16 Jul 2025
https://github.com/prithivsakthiur/data-board
Data Boards - Visualization of various plots ( Analysis )
data-analysis gradio huggingface keras mathplotlib pandas plots pyplot scikit-learn seaborn spaces
Last synced: 28 Oct 2025
https://github.com/prithivsakthiur/wiki-scrap
Wikipedia Scrapping
bs4 content gradio gradio-interface image requests scrapper scrapping sdk-python text wikipedia
Last synced: 09 Sep 2025
https://github.com/prithivsakthiur/prompt-extender-model
The "Prompt Extender" is a tool or concept that can be applied in various fields, from artificial intelligence to creative writing. Its primary function is to expand or elaborate on a given prompt or idea, providing additional context, details, or possibilities.
extend hugging huggingface huggingface-transformers machine-learning models prompt text
Last synced: 04 Apr 2025
https://github.com/prithivsakthiur/imgscope-ocr-2b-0527
The Imgscope-OCR-2B-0527 model is a fine-tuned version of Qwen2-VL-2B-Instruct, specifically optimized for messy handwriting recognition, document OCR, realistic handwritten OCR, and math problem solving with LaTeX formatting. This model is trained on custom datasets for document and handwriting OCR tasks and textual understanding
caption gradio huggingface-transformers llms ocr ollama-gui py qwen2-vl video-processing vlm
Last synced: 08 Aug 2025
https://github.com/prithivsakthiur/agent-dino
Dino: The Minimalist Multipurpose Chat System
agent deepseek-r1 demo diffusion-models huggingface-spaces llm object-detection text-to-image vision-language-model yolo
Last synced: 29 Jul 2025
https://github.com/prithivsakthiur/type-bytes
Dynamic Typed Videos with 'Type Byte'
blog byte clipvideo color frame framework gradio gradio-interface huggingface moviepy pillow python text text-to-video textwrap video
Last synced: 19 Nov 2025
https://github.com/prithivsakthiur/grab-doc-v
MS Word Like Content Creation System
app cdn chatbot chatgpt client docx fpdf grab gradio huggingface llama llama2 llama3 minimalist openai python python-docx scrap
Last synced: 08 Dec 2025
https://github.com/prithivsakthiur/vlm-parsing
VLM-Parsing is a Gradio-based web application for parsing documents and images into structured HTML and Markdown formats using advanced Vision Language Models (VLMs).
cuda gradio html huggingface-models huggingface-spaces huggingface-transformers logics markdown ocr-recognition pytorch qwen2-5-vl spaces vlm
Last synced: 10 Oct 2025
https://github.com/prithivsakthiur/grab-doc
Chat Response Documentation
documentation-tool docx-generator fpdf gradio hugging-face huggingface-hub mistral-7b pdf-generation python-docx text-mining txt-files
Last synced: 24 Oct 2025
https://github.com/prithivsakthiur/nano-banana-aio
Nano Banana AIO is a web application built with React and the Google Gemini API for image generation and editing. It provides an all-in-one interface for creating, editing, and manipulating images using AI-powered tools.
css docker docker-image gemini-flash google-cloud-platform html image-editor javascript nano-banana typescript
Last synced: 08 Nov 2025
https://github.com/prithivsakthiur/multimodal-ocr
OCR Vision Language Model
multimodal-large-language-models ocr qwen2-vl vision-transformer vlm
Last synced: 02 Mar 2025
https://github.com/prithivsakthiur/bellatrix-tiny3-1b-webgpu
webgpu based llm chatbot, try on chrome browsers
chatbot huggingface-transformers llama llm ollama onnx onnx-models onnxruntime webgpu
Last synced: 17 Mar 2025
https://github.com/prithivsakthiur/deep-fake-detector-model
Vision Transformer (ViT)
deep-fake detection google image-classification transformer vision-transformer
Last synced: 02 Mar 2025
https://github.com/prithivsakthiur/flux-image-captioner
FLUX.1-dev with Qwen2VL Captioner and Prompt Enhancer
captioning-images flux google huggingface-transformers json llm-inference qwen2vl t5-small
Last synced: 10 Oct 2025
https://github.com/prithivsakthiur/grid-6x
Layout for Seamless Image Assembly
epic-realism gradio grid lora python realism sdxl stable-diffusion turbo turbo-realism
Last synced: 29 Mar 2025
https://github.com/prithivsakthiur/qwen-image-edit-2509-loras-fast
Qwen-Image-Edit-2509-LoRAs-Fast is a high-performance, user-friendly web application built with Gradio that leverages the advanced Qwen/Qwen-Image-Edit-2509 model from Hugging Face for seamless image editing tasks.
aoti diffusers diffusion-models flash-attention-3 huggingface-spaces huggingface-transformers kernel numpy peft python pytorch qwen-image-edit qwen-image-edit-2509 qwen2-5-vl qwen3-vl torch torchvision
Last synced: 18 Nov 2025
https://github.com/prithivsakthiur/spam-or-ham
Spam Detection Using Bert
bert google single-shot-detection spam-detection text-classification uncased
Last synced: 13 Mar 2025
https://github.com/prithivsakthiur/flux-sketch-smudge-3to1
3:1 Best Image Gen
Last synced: 24 Dec 2025
https://github.com/prithivsakthiur/qwen2-vl
Qwen2VL Models: Vision and Language Processing . FT; [ Latex OCR, Math Parsing, Text Analogy OCRTest ]
Last synced: 24 Dec 2025
https://github.com/prithivsakthiur/triangulum
Triangulum 10B: Multilingual Large Language Models (LLMs)
10b 1b 5b llama-cpp llama-cpp-python llm ollama text-generation
Last synced: 22 Feb 2025
https://github.com/prithivsakthiur/convert-to-onnx-hf-dir
Convert a Hugging Face model to ONNX & Upload Directly to Your Hf Model Repo
conversion huggingface huggingface-transformers onnx onnxruntime optimum streamlit
Last synced: 21 Mar 2025
https://github.com/prithivsakthiur/auto-abliteration
modify a language model's behavior by abliterating its weights.
abliteration gemma3 huggingface-transformers llm llms ollama streamlit uncensored
Last synced: 21 Mar 2025
https://github.com/prithivsakthiur/codepy-deepthink-3b
step-by-step solutions, creative content, and logical analyses
codepy coder cot deep-learning deep-think llama lmstudio ollama reasoning
Last synced: 09 Nov 2025
https://github.com/prithivsakthiur/smart-search
One shot search ...........
algorithms docker prompt-engineering search-engine
Last synced: 04 Apr 2025
https://github.com/prithivsakthiur/fire-detection-siglip2
Fire-Detection-Siglip2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to detect fire, smoke, or normal conditions using the SiglipForImageClassification architecture.
fire-detection google huggingface huggingface-transformers image-classification llama normal siglip siglip2 smoke vit
Last synced: 01 Apr 2025
https://github.com/prithivsakthiur/trash-net
Trash-Net is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify images of waste materials into different categories using the SiglipForImageClassification architecture
classification google gradio huggingface-transformers image-classification image-processing pillow python torch trash vision-transformer
Last synced: 09 Apr 2025
https://github.com/prithivsakthiur/human-action-recognition
Human-Action-Recognition is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for multi-class human action recognition. It uses the SiglipForImageClassification architecture to predict human activities from still images.
action huggingface-transformers human recognition siglip2
Last synced: 05 Sep 2025
https://github.com/prithivsakthiur/clipart-126-domainnet
Clipart-126-DomainNet is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify clipart images into 126 domain categories using the SiglipForImageClassification architecture
art classification demo-app gradio huggingface-spaces huggingface-transformers image-classification llama siglip2 torchvision vision-transformer
Last synced: 26 Mar 2025
https://github.com/prithivsakthiur/website-code-scraper-ec2-testing
Content based site code scraping. CDN Content Delivery Network x BS4
analysis-framework bs4 bs4-requests cdn coding ec2 gradio-interface langchain webscraping
Last synced: 04 Apr 2025
https://github.com/prithivsakthiur/omni-reasoner-vision
Omni Reasoner for Visionn
Last synced: 27 Dec 2025
https://github.com/prithivsakthiur/imagineo-chat-video-understanding
multimodal chatbot
cv2 gemma3 google gradio huggingface-transformers opencv video-processing
Last synced: 22 Jun 2025