{"id":46480362,"url":"https://github.com/nilukush/resume-parser","last_synced_at":"2026-03-06T08:12:21.111Z","repository":{"id":339913774,"uuid":"1163830667","full_name":"nilukush/resume-parser","owner":"nilukush","description":"AI-Powered Resume Parser with OCR, NLP, and AI Enhancement","archived":false,"fork":false,"pushed_at":"2026-02-22T13:57:46.000Z","size":343,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2026-02-22T14:06:22.927Z","etag":null,"topics":["ai","gpt-4","nlp-parsing","openai","resume","resume-parser"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/nilukush.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2026-02-22T08:02:40.000Z","updated_at":"2026-02-22T08:07:49.000Z","dependencies_parsed_at":null,"dependency_job_id":null,"html_url":"https://github.com/nilukush/resume-parser","commit_stats":null,"previous_names":["nilukush/resume-parser"],"tags_count":null,"template":false,"template_full_name":null,"purl":"pkg:github/nilukush/resume-parser","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nilukush%2Fresume-parser","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nilukush%2Fresume-parser/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nilukush%2Fresume-parser/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nilukush%2Fresume-parser/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/nilukush","download_url":"https://codeload.github.com/nilukush/resume-parser/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nilukush%2Fresume-parser/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":30167089,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-03-06T07:56:45.623Z","status":"ssl_error","status_checked_at":"2026-03-06T07:55:55.621Z","response_time":250,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.6:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ai","gpt-4","nlp-parsing","openai","resume","resume-parser"],"created_at":"2026-03-06T08:12:20.460Z","updated_at":"2026-03-06T08:12:21.093Z","avatar_url":"https://github.com/nilukush.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# ResuMate - Smart Resume Parser\n\nAn intelligent resume parsing platform that extracts structured data from resumes using OCR, NLP, and AI.\n\n## Tech Stack\n\n- **Backend:** FastAPI (Python 3.11+) with WebSocket support\n- **Frontend:** React 18+ with TypeScript and WebSocket hooks\n- **Database:** PostgreSQL with JSONB\n- **AI/ML:** Tesseract OCR, spaCy NLP, OpenAI GPT-4\n- **Deployment:** Railway (backend), Vercel (frontend)\n- **Testing:** Pytest (backend), Vitest (frontend)\n\n## Features\n\n- Multi-format support (PDF, DOCX, DOC, TXT)\n- Real-time parsing progress via WebSocket\n- NLP-based entity extraction\n- Confidence scoring\n- Review and edit parsed data\n- **Shareable links** with configurable expiration\n- **Export to PDF** with professional formatting\n- **Social sharing** (WhatsApp, Telegram, Email)\n- Access tracking and share revocation\n\n## Project Structure\n\n```\nresume-parser/\n|-- backend/          # FastAPI application\n|   |-- app/\n|   |   |-- api/      # API routes \u0026 WebSocket handlers\n|   |   |-- models/   # SQLAlchemy models \u0026 progress types\n|   |   |-- services/ # Business logic (parser, orchestrator, export)\n|   |   |-- core/     # Storage, config, database\n|   |   `-- main.py   # FastAPI app entry\n|   |-- tests/\n|   |   |-- unit/     # Unit tests\n|   |   |-- integration/  # Integration tests\n|   |   `-- e2e/      # End-to-end tests\n|   `-- requirements.txt\n|-- frontend/         # React application\n|   |-- src/\n|   |   |-- components/ # React components\n|   |   |-- pages/      # Page components\n|   |   |-- hooks/      # Custom React hooks (WebSocket)\n|   |   |-- services/   # API client\n|   |   `-- lib/        # Utilities\n|   `-- package.json\n`-- docs/\n    `-- plans/\n```\n\n## Getting Started\n\n### Backend Setup\n\n```bash\ncd backend\n\n# Create virtual environment\npython3 -m venv .venv\nsource .venv/bin/activate  # On Windows: .venv\\Scripts\\activate\n\n# Install dependencies\npip install -r requirements.txt\n\n# Download spaCy model\npython -m spacy download en_core_web_sm\n\n# Setup environment\ncp .env.example .env\n# Edit .env with your configuration\n\n# Run development server\nuvicorn app.main:app --reload --host 0.0.0.0 --port 8000\n```\n\nBackend will be available at http://localhost:8000\n\n### Frontend Setup\n\n```bash\ncd frontend\n\n# Install dependencies\nnpm install\n\n# Setup environment\ncp .env.example .env\n# Edit .env if needed (defaults to http://localhost:8000)\n\n# Run development server\nnpm run dev\n```\n\nFrontend will be available at http://localhost:3000\n\n### Testing\n\n**Backend:**\n```bash\ncd backend\nsource .venv/bin/activate\npython -m pytest tests/ -v\n```\n\n**Frontend:**\n```bash\ncd frontend\nnpm test -- --run\nnpm run type-check\n```\n\n## Usage Flow\n\n1. **Upload** - Upload resume (PDF, DOCX, DOC, TXT) at http://localhost:3000\n2. **Processing** - Watch real-time parsing progress via WebSocket\n3. **Review** - Review extracted data with confidence scores, make corrections\n4. **Share** - Create shareable links, export to PDF, or share via social media\n\n## WebSocket Communication\n\n**Connection:**\n```\nws://localhost:8000/ws/resumes/{resume_id}\n```\n\n**Progress Update Message:**\n```json\n{\n  \"type\": \"progress_update\",\n  \"stage\": \"text_extraction | nlp_parsing | ai_enhancement | complete\",\n  \"progress\": 50,\n  \"status\": \"Extracting text...\",\n  \"estimated_seconds_remaining\": 15\n}\n```\n\n## Share and Export API\n\n### Create Shareable Link\n\n```http\nPOST /v1/resumes/{resume_id}/share\n```\n\n**Response:**\n```json\n{\n  \"share_token\": \"uuid-v4-token\",\n  \"share_url\": \"http://localhost:3000/share/uuid-v4-token\",\n  \"expires_at\": \"2026-03-20T12:00:00\"\n}\n```\n\n### Export Resume\n\n**PDF:**\n```http\nGET /v1/resumes/{resume_id}/export/pdf\n```\nReturns PDF file with `Content-Type: application/pdf`\n\n**WhatsApp:**\n```http\nGET /v1/resumes/{resume_id}/export/whatsapp\n```\n```json\n{\n  \"whatsapp_url\": \"https://wa.me/?text=...\"\n}\n```\n\n**Telegram:**\n```http\nGET /v1/resumes/{resume_id}/export/telegram\n```\n```json\n{\n  \"telegram_url\": \"https://t.me/share/url?url=\u0026text=...\"\n}\n```\n\n**Email:**\n```http\nGET /v1/resumes/{resume_id}/export/email\n```\n```json\n{\n  \"mailto_url\": \"mailto:?subject=...\u0026body=...\"\n}\n```\n\n### Public Share Access\n\n```http\nGET /v1/share/{share_token}\n```\n\nReturns resume data without authentication. Status codes:\n- `200` - Success\n- `403` - Share has been revoked\n- `404` - Share not found\n- `410` - Share has expired\n\n### Revoke Share\n\n```http\nDELETE /v1/resumes/{resume_id}/share\n```\n\n**Response:**\n```json\n{\n  \"message\": \"Share revoked successfully\",\n  \"resume_id\": \"resume-id\"\n}\n```\n\n## Environment Variables\n\n**Backend (.env):**\n- `DATABASE_URL` - PostgreSQL connection string\n- `REDIS_URL` - Redis connection for Celery\n- `OPENAI_API_KEY` - OpenAI API key (optional)\n- `SECRET_KEY` - Secret key for signing\n- `ALLOWED_ORIGINS` - CORS allowed origins\n\n**Frontend (.env):**\n- `VITE_API_BASE_URL` - Backend API base URL\n- `VITE_WS_BASE_URL` - WebSocket base URL\n\n## Test Results\n\n### Backend Tests: 120/120 Passing\n- Unit tests: 67 tests\n- Integration tests: 50 tests\n- E2E tests: 4 tests\n\n### Frontend Tests: 31/31 Passing\n- Component tests: 31 tests\n- Type check: Passed (TypeScript strict mode)\n\n## License\n\nMIT\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnilukush%2Fresume-parser","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fnilukush%2Fresume-parser","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnilukush%2Fresume-parser/lists"}