Projects in Awesome Lists by XenosWarlocks
A curated list of projects in awesome lists by XenosWarlocks .
https://github.com/xenoswarlocks/company-url-finder
Company URL Finder is a robust Python application designed to help you efficiently search and extract company website URLs using multiple strategies. The project provides two main search approaches:
algorithm algorithms custom-search-api custom-search-engine google python3 selenium-webdriver
Last synced: 20 Dec 2024
https://github.com/xenoswarlocks/horse-spider
Horse Spider is a Python-based project designed to perform Google searches, extract links from the search results, and scrape the content from those links. The project uses Selenium for simulating human-like interactions to avoid being flagged as a bot. The scraped data can be exported in multiple formats, such as CSV, JSON, and Excel.
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/crmpanel
React-based internal tools, admin panels, dashboards, B2B apps with flexibility in mind.
graphql reactjs refine typescript
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/super-duper-engine
An automated web scraping tool that extracts, processes, and analyzes lead generation job listings from Naukri.com. This project implements a robust scraping pipeline with built-in data cleaning and processing capabilities.
Last synced: 06 Apr 2025
https://github.com/xenoswarlocks/image_text_extractor
A Python-based tool for batch processing and extracting text from images using OCR (Tesseract). The extracted text is cleaned by removing unwanted terms, and potential names are identified and formatted. Results are saved in a structured text file for easy reference. Ideal for automating data extraction and preprocessing tasks.
ocr-python ocr-recognition pytesseract-ocr python unittest
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/file-name-conversion-tool
This Node.js script reads a text file containing names in "First Last" format, converts them into "firstname.lastname" format, adds a domain, and saves the converted names to a new text file.
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/video-processing-tool
The project will be divided into three phases for structured development and feature enhancement.
Last synced: 06 Apr 2025
https://github.com/xenoswarlocks/humanselenium
This project aims to develop a tool using Selenium WebDriver to automate data scraping tasks from websites, mimicking human interaction. Think of it as an extension of your arm in the digital world, collecting data just like you would.
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/powershell-smartstart
Enhance your pc startup routine by setting a random wallpaper, launching essential applications, checking network connectivity, and system resources and providing a daily greeting powered by gemini AI.
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/multicrawl
MultiCrawl is a powerful and flexible web crawling framework that provides multiple crawling strategies to suit different use cases and performance requirements. The library supports sequential, threaded, and asynchronous crawling methods, making it adaptable to various data extraction needs.
python threading web webcrawler
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/email-scraper
"email_scraper.py" extracts email addresses from a webpage. It uses requests and BeautifulSoup libraries, saving names and emails into separate text files. Easy-to-use with command-line interface.
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/similaroccult
This repository contains two Node.js scripts (`script.js` and `Second script.js`) designed to process text data from a file named uni.txt.
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/house-price-scraper
This project is a web scraper built using Scrapy to extract property data from Rightmove's house prices pages for the Southwark area. The spider crawls through the first 40 pages of the listings and saves the data to a CSV file.
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/trustlink-scraper
TrustLink-Scraper is a Python project that scrapes faculty profiles from the USC Dornsife website, extracts emails, and cleans the collected data. The project is divided into two scripts: `main.py` for scraping and `cleaner.py` for cleaning the data.
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/web-scraping-with-mechanize
This Ruby script utilizes the Mechanize gem to scrape a webpage. Callbacks are registered to print when navigating to and landing on a page. Still learning Ruby.
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/read-write-analysis
This project demonstrates various file writing strategies in Python, providing insights into different performance optimization techniques for file I/O operations.
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/profiletracker
An automated tool for tracking and validating LinkedIn profile changes using Python and Selenium WebDriver.
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/web-scraper-names-and-emails
Web Scraper extracts names and emails from a webpage. "scraper.py" defines the extraction logic, while "main.py" implements the scraping process. Development ongoing.
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/python-web-scraping-and-data-integration-toolkit
This Python repository provides a toolkit for web scraping, data cleaning, and integration tasks. The process involves scraping data from a specified URL, cleaning the extracted text to remove unwanted substrings, replacing specific characters, and extracting first and last names from the cleaned text.
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/gogermany
Gogermany is a web application built with Next.js, designed to help you learn German in a fun and interactive way. Inspired by Duolingo, Gogermany offers a gamified approach to language learning, at a fraction of the cost.
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/nestcart
Warmth, comfort, and a homely touch.
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/urban-wheels
Urban Wheels is a mobile application designed to replicate the core functionalities of a ride-hailing service. This project is built using React Native, Google Maps, Stripe, Serverless Postgres, and TailwindCSS, aiming to demonstrate the development of a full-stack mobile application that is both functional and visually appealing.
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/identity-dashboard
New to Vue.js, creating a small dashboard to learn Vue.
Last synced: 19 Feb 2025
https://github.com/xenoswarlocks/patternwriter
This tool automates the process of verifying email addresses extracted from contact information provided in Excel files. It uses the email_validator library to validate email addresses and generates a report of verified email addresses along with their verification status.
Last synced: 01 Mar 2025
https://github.com/xenoswarlocks/metriforge
Generate intelligent, realistic test emails containing system performance metrics, trend analysis, and actionable insights using AI
Last synced: 12 Mar 2025
https://github.com/xenoswarlocks/email_pattern_verification
An webapp to verify email based on their patterns
Last synced: 23 Feb 2025
https://github.com/xenoswarlocks/newsletter-sponsor-analyzer
AI-powered Python tool for extracting and analyzing sponsor content from JavaScript Weekly newsletters using Google's Gemini model.
Last synced: 20 Mar 2025
https://github.com/xenoswarlocks/horse-scraping
Horse Scraper is a Python-based project designed to perform Google searches, extract links from the search results, and scrape the content from those links. The project uses Selenium for simulating human-like interactions to avoid being flagged as a bot. The scraped data can be exported in multiple formats, such as CSV, JSON, and Excel.
Last synced: 19 Feb 2025