Projects in Awesome Lists by PRITHIVSAKTHIUR

https://github.com/prithivsakthiur/how-to-run-huggingface-spaces-on-local-machine-demo

Running Hugging Face Spaces on a local machine / colab T4 GPU involves several steps. Hugging Face Spaces is a platform to host machine learning demos and applications using Streamlit, Gradio, or other frameworks.

demo huggingface spaces

Last synced: 02 Jul 2025

https://github.com/prithivsakthiur/stable-hamster

Fast as Hamster | Stable Hamster | Stable Diffusion

4k bot diffusion-models fast hamster hdr huggingface huggingface-transformers image image-generation modeling

Last synced: 16 Aug 2025

https://github.com/PRITHIVSAKTHIUR/FLUX-REALISM

Enhancing Face Realism / Epic Realism [ LoRA ]

black-forest-labs flux flux-dev flux-schnell gradio gradio-interface image lora realism text

Last synced: 16 Aug 2025

https://github.com/prithivsakthiur/video-background-remover

Remove Backgroud from a video 🧤🎥

a100 background gpu gradio gradio-interface huggingface models nvidia opencv-python remover-tool video

Last synced: 06 May 2025

https://github.com/prithivsakthiur/gallo-3xl

High Quality Image Generation Model - Powered with NVIDIA A100

ai dall-e dalle2 dalle3 diffusers gradio huggingface image-generation peft peft-fine-tuning-llm text-to-image torch transformers

Last synced: 16 Aug 2025

https://github.com/prithivsakthiur/pdf-word-converter

PDF to Word converter is an easy-to-use, in-browser tool that allows you to quickly and securely export a PDF to a Microsoft Word file format (.doc) ...

app converter docx gradio huggingface pdf pdf-converter python to word

Last synced: 06 May 2025

https://github.com/prithivsakthiur/magic-eraser-tool

You don't have to worry about mastering photo editing techniques to remove an object from your photo. ** Simply mark over the areas you want to erase, and our AI will take care of the rest.

app docker eraser magic ml numpy python streamlit streamlit-webapp tensorflow

Last synced: 06 May 2025

https://github.com/prithivsakthiur/huggingface-android-application

URL to App Conversion

apk apktool app application drive go google html hugging-face java javascript kotlin median native procedural readme sdk shell steps url

Last synced: 11 Jul 2025

https://github.com/prithivsakthiur/imagineo-4k

Midjourney X Instant Collage -- Collage Template + Grid + Quality Style

collage computer-vision dalle dalle-3 diffusion-models fast gpu grid image image-generation image-generation-ai image-processing midjourney nvidia-gpu pytorch texttoimage zero

Last synced: 16 Aug 2025

https://github.com/prithivsakthiur/save-web-as-zip

Save any web url as zip ( image + assets + html + css + js )

beatifulsoup beautifulsoup4 huggingface spaces web webscraping webscraping-data website zip

Last synced: 06 May 2025

https://github.com/prithivsakthiur/face-swap-roop

Face-Swapper | Gradio Work Space | .hf.space

face gradio gradio-interface image image-image image-processing model python roop swap

Last synced: 16 Aug 2025

https://github.com/prithivsakthiur/hospital-management-system

Hospital Management System Using StreamLit Application

cloud community model python3 stream streamlit-webapp webapp

Last synced: 06 May 2025

https://github.com/prithivsakthiur/picture-speaks

i-Talk, you-Talk, we-Talk

audio gradio image models picture text

Last synced: 06 May 2025

https://github.com/prithivsakthiur/rag-pdf-chatbot

(PDF) Information and Inference, Retrieval-Augmented Generation [ RAG ]

llm packages-manager pdf-document-processor pdf-viewer streamlit

Last synced: 06 May 2025

https://github.com/prithivsakthiur/strangerx-multimodal-chatbot

Multimodal-ChatBot-App, LLM Model Id Doesn't work outside, since it is opeartable inside the Huggingface Models. Recommended to Deploy inside Huggingface Spaces SDK as StreamLit.

api chatbot chatbotapp chatgpt machine-learning mistral-7b model modelidentifire multimodal phi streamlit streamlit-webapp

Last synced: 28 Jul 2025

https://github.com/prithivsakthiur/rabbitr1-strom-chatbot

Chatbot System

chatbot gpt llm mistral-7b mistralai model text tts tts-api voice

Last synced: 16 Aug 2025

https://github.com/prithivsakthiur/ai-art-generator-sdxl

AUTOMATIC1111: Software for tensor operations, saving tensor data in .safetensors format. ComfyUI: UI library, possibly managing tensor data safely with *.safetensors. InvokeAI: ML platform using *.safetensors for secure tensor storage.

api art gradio gradio-custom-component gradio-interface gradio-python-llm identity llm machine-learning model modeling sdxl

Last synced: 19 Oct 2025

https://github.com/prithivsakthiur/video-face-swapper

Model trained for educational purposes only; usage subject to terms and conditions. 📟

deepfakes face model roop swap

Last synced: 06 May 2025

https://github.com/prithivsakthiur/prompt-extender-gradio

The "Prompt Extender" is a tool or concept that can be applied in various fields, from artificial intelligence to creative writing. Its primary function is to expand or elaborate on a given prompt or idea, providing additional context, details, or possibilities.

extention prompt

Last synced: 06 May 2025

https://github.com/prithivsakthiur/image-captioning-salesforce-blip

The BlipProcessor and BlipForConditionalGeneration are likely classes specific to a model called "Blip," which seems to be a transformer-based model for conditional text generation.

captioning-images image-processing

Last synced: 06 Jul 2025

https://github.com/prithivsakthiur/medical-map-streamlit

Medical Map - Hospital Location & Details 🗺️🗞️

google hospital-management map mapping medicalmap

Last synced: 29 Jun 2025

https://github.com/prithivsakthiur/strangerai

Turning Ideas to Product - StrangerAI - StrangerZone. Recommended to Deploy inside Huggingface Spaces SDK as GRADIO

api chat-application chatbot chatgpt llm-inference open-source openai openapi

Last synced: 16 Aug 2025

https://github.com/prithivsakthiur/airbnb-nyc-maps

Airbnb Price in NYC ( Select Boroughs )

airbnb classifiers gradio huggingface map maps nyc plotly point price price-tracker pyplot range space

Last synced: 28 Jun 2025

https://github.com/prithivsakthiur/imagegpt-4xl

Image Knowledge base / Idea / Problem Solve / Analysis

gradio huggingface huggingface-transformers image instructor microsoft phi python question-answering

Last synced: 06 May 2025

https://github.com/prithivsakthiur/plant-diseases-cnn

Model trained for educational purposes only; usage subject to terms and conditions.

analysis analysis-algorithms cnn disease-prediction git github-config gradio h5 lfs model plant training

Last synced: 06 May 2025

https://github.com/prithivsakthiur/vision-inference

What Happen Next ? Live Inference

css docker html inference-engine javascript live model

Last synced: 06 May 2025

https://github.com/prithivsakthiur/1000-general-knowledge-flashcards

1000 Flashcards ( General, Sports, Technical,Space ) 📔📔

1000 flashcards general gradio hugging-face knowledge python questions quiz spaces

Last synced: 06 May 2025

https://github.com/prithivsakthiur/plant-status-gradio

🍃Upload a photo of a plant to see how the model classifies its status!🍃

classifier disease-prediction gradio h5 mbs model plants

Last synced: 06 May 2025

https://github.com/prithivsakthiur/web-data-scraper

Data text successfully scraped! - Put & Get

data scraper streamlit web

Last synced: 06 May 2025

https://github.com/prithivsakthiur/youtube-video-downloader

Enter YouTube link 🔗 To Download Video⬇️

downloader video youtube

Last synced: 06 May 2025

https://github.com/prithivsakthiur/bert-uncased

BERT is a transformers model pretrained on a large corpus of English data in a self-supervised fashion. This means it was pretrained on the raw texts only, with no humans labeling them in any way (which is why it can use lots of publicly available data) with an automatic process to generate inputs and labels from those texts.

bert betelgeuse google llm model package text text-classification transformer

Last synced: 06 May 2025

https://github.com/prithivsakthiur/ehrm-demo

Last synced: 01 Aug 2025

https://github.com/prithivsakthiur/medical-term-article-search

HealthCare-Informatics-MediSearch

api client information-retrieval json jsonwebtoken machine-learning medical medisearch python3 records results solutions streamlit streamlit-webapp webapp

Last synced: 16 Aug 2025

https://github.com/prithivsakthiur/text-tokenizer-playground

Text Tokenizer Playground ( Transformers.js ) SDK in Hugginface.

css html js playground text tokenization tokenization-client

Last synced: 06 May 2025

https://github.com/prithivsakthiur/top-prompt-collection

A prompt library is a collection of image prompts that can be used to generate new and interesting image.

collections css html huggingface list prompt queue static-site webpage website

Last synced: 02 Sep 2025

https://github.com/prithivsakthiur/bidirectional-and-auto-regressive-transformer-cnn

BART’s primary task is used to generate clean semantically coherent text from corrupted text data but it can also be used for a variety of different NLP sub-tasks like language translation, question-answering tasks, text summarization, paraphrasing, etc.

Last synced: 06 May 2025

https://github.com/prithivsakthiur/master-gpt

Chat, Web, Media, Image GPT

api chat-application chatbot chatgpt demo fast huggingface image imagegpt mode muti-process search searchenginemarketing web zip

Last synced: 06 May 2025

https://github.com/prithivsakthiur/chatbot-gpt

3-In-1-Chatbot - GPT

ai app chat-application chatbot chatgpt llm mistral multimodal streamlit-webapp

Last synced: 06 May 2025

https://github.com/prithivsakthiur/all-in-one-downloader

yt-dlp is a feature-rich command-line audio/video downloader with support for thousands of sites. The project is a fork of youtube-dl based on the now inactive youtube-dlc.

downloader gradio huggingface instagram pkt python reddit twitter video workspace x youtube yt-dlp

Last synced: 06 May 2025

https://github.com/prithivsakthiur/prithivsakthiur

Last synced: 04 Apr 2025

https://github.com/prithivsakthiur/strangerzone-chatbot

Turning Ideas to Product - StrangerAI - StrangerZone - Zero Data Collection from User.

Last synced: 16 Aug 2025

https://github.com/prithivsakthiur/canopus-realism

Realistic Image Generation, Realistic trigger works properly, better for photorealistic trigger words, close-up shots, face diffusion, male, female characters.

computer-vision gradio lora model realism realistic sdxl stable-diffusion text2image

Last synced: 16 Aug 2025

https://github.com/prithivsakthiur/orpheus-tts-edge

Play with Orpheus TTS, a Llama-based Speech-LLM designed for high-quality, empathetic text-to-speech generation. This model has been fine-tuned to deliver human-level speech synthesis 🔥🗣️

gradio gradio-python-llm huggingface-transformers llama llm orpheus tts

Last synced: 12 Jul 2025

https://github.com/prithivsakthiur/stablediffusion

Continuous progress in AI research leads to the development of more robust algorithms, models, and techniques, making AI solutions more effective and reliable.

ai api apiurlrequest app generation genetic-algorithm gradio huggingface image llama llm python sdxl stability-ai stable-diffusion transformer

Last synced: 16 Aug 2025

https://github.com/prithivsakthiur/stable-wallpapers

Demo space for generating, Desktop / Mobile Wallpapers. 16:9 / 9:16 #Dream Wallpaper by Stable Wallpaper [ stable diffusion xl ]

diffusion-models gradio huggingface huggingface-transformers lightning models stable-diffusion texttoimage transformer wallpapers

Last synced: 06 May 2025

https://github.com/prithivsakthiur/client-record-curd-ops-exercise

Client Record Management - CURD OPs + Blazor Web Assembly with Standalone App

blazor client csharp curd-operation dotnet dotnet-core entity-framework mangement microsoftsqlserver records webapi

Last synced: 09 Jul 2025

https://github.com/prithivsakthiur/gen-vision

Multiple Conditioned Image Generation, SDXL, Low-rank adaptation Refined

colab-notebook cookbooks diffusers gen gradio huggingface imagegeneration lora spaces stable-diffusion texttoimage transformer

Last synced: 16 Aug 2025

https://github.com/prithivsakthiur/yolox-t4

Ultralytics, YOLO v8 - Computer Vision

computer-vision engine inference ultralytics video yolo yolov8

Last synced: 05 Oct 2025

https://github.com/prithivsakthiur/yolo-nms-captioning

Object Detection - Captioning ( yolo8n & blip-image-captioning-large )

Last synced: 24 Oct 2025

https://github.com/prithivsakthiur/yolox-cpu

Ultralytics, YOLO v8 - Computer Vision

counting heatmap-visualization inference object object-detection speedestimation tensorflow ultralytics video videointelligence workout-tracker yolo yolov8

Last synced: 11 Oct 2025

https://github.com/prithivsakthiur/text-to-image

Text to Image Gen [ Demo ]

a100 ai colab-notebook demo face genai hugging huggingface image image-generation image-processing ipython-notebook jupyter-notebook t4 text transformer

Last synced: 29 Jul 2025

https://github.com/prithivsakthiur/llama-3.2-1b-gguf-fast-gradio

GPT-Generated Unified Format

Last synced: 16 Jul 2025

https://github.com/prithivsakthiur/data-board

Data Boards - Visualization of various plots ( Analysis )

data-analysis gradio huggingface keras mathplotlib pandas plots pyplot scikit-learn seaborn spaces

Last synced: 28 Oct 2025

https://github.com/prithivsakthiur/wiki-scrap

Wikipedia Scrapping

bs4 content gradio gradio-interface image requests scrapper scrapping sdk-python text wikipedia

Last synced: 09 Sep 2025

https://github.com/prithivsakthiur/prompt-extender-model

The "Prompt Extender" is a tool or concept that can be applied in various fields, from artificial intelligence to creative writing. Its primary function is to expand or elaborate on a given prompt or idea, providing additional context, details, or possibilities.

extend hugging huggingface huggingface-transformers machine-learning models prompt text

Last synced: 04 Apr 2025

https://github.com/prithivsakthiur/flux-api

Endpoint Image Generation using Flux

api bfl endpoints flux image

Last synced: 06 May 2025

https://github.com/prithivsakthiur/imgscope-ocr-2b-0527

The Imgscope-OCR-2B-0527 model is a fine-tuned version of Qwen2-VL-2B-Instruct, specifically optimized for messy handwriting recognition, document OCR, realistic handwritten OCR, and math problem solving with LaTeX formatting. This model is trained on custom datasets for document and handwriting OCR tasks and textual understanding

caption gradio huggingface-transformers llms ocr ollama-gui py qwen2-vl video-processing vlm

Last synced: 08 Aug 2025

https://github.com/prithivsakthiur/agent-dino

Dino: The Minimalist Multipurpose Chat System

agent deepseek-r1 demo diffusion-models huggingface-spaces llm object-detection text-to-image vision-language-model yolo

Last synced: 29 Jul 2025

https://github.com/prithivsakthiur/type-bytes

Dynamic Typed Videos with 'Type Byte'

blog byte clipvideo color frame framework gradio gradio-interface huggingface moviepy pillow python text text-to-video textwrap video

Last synced: 19 Nov 2025

https://github.com/prithivsakthiur/grab-doc-v

MS Word Like Content Creation System

app cdn chatbot chatgpt client docx fpdf grab gradio huggingface llama llama2 llama3 minimalist openai python python-docx scrap

Last synced: 08 Dec 2025

https://github.com/prithivsakthiur/vlm-parsing

VLM-Parsing is a Gradio-based web application for parsing documents and images into structured HTML and Markdown formats using advanced Vision Language Models (VLMs).

cuda gradio html huggingface-models huggingface-spaces huggingface-transformers logics markdown ocr-recognition pytorch qwen2-5-vl spaces vlm

Last synced: 10 Oct 2025

https://github.com/prithivsakthiur/grab-doc

Chat Response Documentation

documentation-tool docx-generator fpdf gradio hugging-face huggingface-hub mistral-7b pdf-generation python-docx text-mining txt-files

Last synced: 24 Oct 2025

https://github.com/prithivsakthiur/nano-banana-aio

Nano Banana AIO is a web application built with React and the Google Gemini API for image generation and editing. It provides an all-in-one interface for creating, editing, and manipulating images using AI-powered tools.

css docker docker-image gemini-flash google-cloud-platform html image-editor javascript nano-banana typescript

Last synced: 08 Nov 2025

https://github.com/prithivsakthiur/multimodal-ocr

OCR Vision Language Model

multimodal-large-language-models ocr qwen2-vl vision-transformer vlm

Last synced: 02 Mar 2025

https://github.com/prithivsakthiur/bellatrix-tiny3-1b-webgpu

webgpu based llm chatbot, try on chrome browsers

chatbot huggingface-transformers llama llm ollama onnx onnx-models onnxruntime webgpu

Last synced: 17 Mar 2025

https://github.com/prithivsakthiur/qwq-edge

All in Chatbot

Last synced: 06 Oct 2025

https://github.com/prithivsakthiur/deep-fake-detector-model

Vision Transformer (ViT)

deep-fake detection google image-classification transformer vision-transformer

Last synced: 02 Mar 2025

https://github.com/prithivsakthiur/flux-image-captioner

FLUX.1-dev with Qwen2VL Captioner and Prompt Enhancer

captioning-images flux google huggingface-transformers json llm-inference qwen2vl t5-small

Last synced: 10 Oct 2025

https://github.com/prithivsakthiur/grid-6x

Layout for Seamless Image Assembly

epic-realism gradio grid lora python realism sdxl stable-diffusion turbo turbo-realism

Last synced: 29 Mar 2025

https://github.com/prithivsakthiur/qwen-image-edit-2509-loras-fast

Qwen-Image-Edit-2509-LoRAs-Fast is a high-performance, user-friendly web application built with Gradio that leverages the advanced Qwen/Qwen-Image-Edit-2509 model from Hugging Face for seamless image editing tasks.

aoti diffusers diffusion-models flash-attention-3 huggingface-spaces huggingface-transformers kernel numpy peft python pytorch qwen-image-edit qwen-image-edit-2509 qwen2-5-vl qwen3-vl torch torchvision

Last synced: 18 Nov 2025

https://github.com/prithivsakthiur/ehrm-website

Last synced: 21 Aug 2025

https://github.com/prithivsakthiur/spam-or-ham

Spam Detection Using Bert

bert google single-shot-detection spam-detection text-classification uncased

Last synced: 13 Mar 2025

https://github.com/prithivsakthiur/flux-sketch-smudge-3to1

3:1 Best Image Gen

Last synced: 24 Dec 2025

https://github.com/prithivsakthiur/qwen2-vl

Qwen2VL Models: Vision and Language Processing . FT; [ Latex OCR, Math Parsing, Text Analogy OCRTest ]

Last synced: 24 Dec 2025

https://github.com/prithivsakthiur/triangulum

Triangulum 10B: Multilingual Large Language Models (LLMs)

10b 1b 5b llama-cpp llama-cpp-python llm ollama text-generation

Last synced: 22 Feb 2025

https://github.com/prithivsakthiur/convert-to-onnx-hf-dir

Convert a Hugging Face model to ONNX & Upload Directly to Your Hf Model Repo

conversion huggingface huggingface-transformers onnx onnxruntime optimum streamlit

Last synced: 21 Mar 2025

https://github.com/prithivsakthiur/auto-abliteration

modify a language model's behavior by abliterating its weights.

abliteration gemma3 huggingface-transformers llm llms ollama streamlit uncensored

Last synced: 21 Mar 2025

https://github.com/prithivsakthiur/codepy-deepthink-3b

step-by-step solutions, creative content, and logical analyses

codepy coder cot deep-learning deep-think llama lmstudio ollama reasoning

Last synced: 09 Nov 2025

https://github.com/prithivsakthiur/smart-search

One shot search ...........

algorithms docker prompt-engineering search-engine

Last synced: 04 Apr 2025

https://github.com/prithivsakthiur/fire-detection-siglip2

Fire-Detection-Siglip2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to detect fire, smoke, or normal conditions using the SiglipForImageClassification architecture.

fire-detection google huggingface huggingface-transformers image-classification llama normal siglip siglip2 smoke vit

Last synced: 01 Apr 2025

https://github.com/prithivsakthiur/trash-net

Trash-Net is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify images of waste materials into different categories using the SiglipForImageClassification architecture

classification google gradio huggingface-transformers image-classification image-processing pillow python torch trash vision-transformer

Last synced: 09 Apr 2025

https://github.com/prithivsakthiur/gwq2b

gemma with questions

chatbot gemma-2b gemma2

Last synced: 19 Jun 2025

https://github.com/prithivsakthiur/human-action-recognition

Human-Action-Recognition is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for multi-class human action recognition. It uses the SiglipForImageClassification architecture to predict human activities from still images.

action huggingface-transformers human recognition siglip2

Last synced: 05 Sep 2025

https://github.com/prithivsakthiur/clipart-126-domainnet

Clipart-126-DomainNet is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify clipart images into 126 domain categories using the SiglipForImageClassification architecture

art classification demo-app gradio huggingface-spaces huggingface-transformers image-classification llama siglip2 torchvision vision-transformer

Last synced: 26 Mar 2025

https://github.com/prithivsakthiur/flux-lora-downloadables

Choose the LoRA and Type the Prompt

dev dlc flux generator image lora prompt text

Last synced: 04 Apr 2025

https://github.com/prithivsakthiur/website-code-scraper-ec2-testing

Content based site code scraping. CDN Content Delivery Network x BS4

analysis-framework bs4 bs4-requests cdn coding ec2 gradio-interface langchain webscraping

Last synced: 04 Apr 2025

Last synced: 22 Jun 2025