Projects in Awesome Lists by XenosWarlocks

https://github.com/xenoswarlocks/company-url-finder

Company URL Finder is a robust Python application designed to help you efficiently search and extract company website URLs using multiple strategies. The project provides two main search approaches:

algorithm algorithms custom-search-api custom-search-engine google python3 selenium-webdriver

Last synced: 20 Dec 2024

https://github.com/xenoswarlocks/horse-spider

Horse Spider is a Python-based project designed to perform Google searches, extract links from the search results, and scrape the content from those links. The project uses Selenium for simulating human-like interactions to avoid being flagged as a bot. The scraped data can be exported in multiple formats, such as CSV, JSON, and Excel.

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/crmpanel

React-based internal tools, admin panels, dashboards, B2B apps with flexibility in mind.

graphql reactjs refine typescript

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/super-duper-engine

An automated web scraping tool that extracts, processes, and analyzes lead generation job listings from Naukri.com. This project implements a robust scraping pipeline with built-in data cleaning and processing capabilities.

data-m lead-generation

Last synced: 06 Apr 2025

https://github.com/xenoswarlocks/image_text_extractor

A Python-based tool for batch processing and extracting text from images using OCR (Tesseract). The extracted text is cleaned by removing unwanted terms, and potential names are identified and formatted. Results are saved in a structured text file for easy reference. Ideal for automating data extraction and preprocessing tasks.

ocr-python ocr-recognition pytesseract-ocr python unittest

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/file-name-conversion-tool

This Node.js script reads a text file containing names in "First Last" format, converts them into "firstname.lastname" format, adds a domain, and saves the converted names to a new text file.

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/video-processing-tool

The project will be divided into three phases for structured development and feature enhancement.

Last synced: 06 Apr 2025

https://github.com/xenoswarlocks/humanselenium

This project aims to develop a tool using Selenium WebDriver to automate data scraping tasks from websites, mimicking human interaction. Think of it as an extension of your arm in the digital world, collecting data just like you would.

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/powershell-smartstart

Enhance your pc startup routine by setting a random wallpaper, launching essential applications, checking network connectivity, and system resources and providing a daily greeting powered by gemini AI.

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/multicrawl

MultiCrawl is a powerful and flexible web crawling framework that provides multiple crawling strategies to suit different use cases and performance requirements. The library supports sequential, threaded, and asynchronous crawling methods, making it adaptable to various data extraction needs.

python threading web webcrawler

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/email-scraper

"email_scraper.py" extracts email addresses from a webpage. It uses requests and BeautifulSoup libraries, saving names and emails into separate text files. Easy-to-use with command-line interface.

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/similaroccult

This repository contains two Node.js scripts (`script.js` and `Second script.js`) designed to process text data from a file named uni.txt.

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/house-price-scraper

This project is a web scraper built using Scrapy to extract property data from Rightmove's house prices pages for the Southwark area. The spider crawls through the first 40 pages of the listings and saves the data to a CSV file.

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/trustlink-scraper

TrustLink-Scraper is a Python project that scrapes faculty profiles from the USC Dornsife website, extracts emails, and cleans the collected data. The project is divided into two scripts: `main.py` for scraping and `cleaner.py` for cleaning the data.

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/web-scraping-with-mechanize

This Ruby script utilizes the Mechanize gem to scrape a webpage. Callbacks are registered to print when navigating to and landing on a page. Still learning Ruby.

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/beautysoup

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/read-write-analysis

This project demonstrates various file writing strategies in Python, providing insights into different performance optimization techniques for file I/O operations.

concurrency python threading

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/webscrapers

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/profiletracker

An automated tool for tracking and validating LinkedIn profile changes using Python and Selenium WebDriver.

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/contactcollector

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/web-scraper-names-and-emails

Web Scraper extracts names and emails from a webpage. "scraper.py" defines the extraction logic, while "main.py" implements the scraping process. Development ongoing.

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/python-web-scraping-and-data-integration-toolkit

This Python repository provides a toolkit for web scraping, data cleaning, and integration tasks. The process involves scraping data from a specified URL, cleaning the extracted text to remove unwanted substrings, replacing specific characters, and extracting first and last names from the cleaned text.

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/gogermany

Gogermany is a web application built with Next.js, designed to help you learn German in a fun and interactive way. Inspired by Duolingo, Gogermany offers a gamified approach to language learning, at a fraction of the cost.

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/nestcart

Warmth, comfort, and a homely touch.

db go golang jwt

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/urban-wheels

Urban Wheels is a mobile application designed to replicate the core functionalities of a ride-hailing service. This project is built using React Native, Google Maps, Stripe, Serverless Postgres, and TailwindCSS, aiming to demonstrate the development of a full-stack mobile application that is both functional and visually appealing.

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/identity-dashboard

New to Vue.js, creating a small dashboard to learn Vue.

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/patternwriter

This tool automates the process of verifying email addresses extracted from contact information provided in Excel files. It uses the email_validator library to validate email addresses and generates a report of verified email addresses along with their verification status.

Last synced: 01 Mar 2025

https://github.com/xenoswarlocks/style-v.ai

A swelt-Kit webapp

Last synced: 12 Mar 2025

https://github.com/xenoswarlocks/metriforge

Generate intelligent, realistic test emails containing system performance metrics, trend analysis, and actionable insights using AI

Last synced: 12 Mar 2025

https://github.com/xenoswarlocks/smart-img-ai

Last synced: 12 Mar 2025

https://github.com/xenoswarlocks/email_pattern_verification

An webapp to verify email based on their patterns

Last synced: 23 Feb 2025

https://github.com/xenoswarlocks/newsletter-sponsor-analyzer

AI-powered Python tool for extracting and analyzing sponsor content from JavaScript Weekly newsletters using Google's Gemini model.

Last synced: 20 Mar 2025

https://github.com/xenoswarlocks/scraperapp

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/adashb

Golang admin panel (in progress)

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/trustlinked

Last synced: 19 Feb 2025

https://github.com/xenoswarlocks/horse-scraping

Horse Scraper is a Python-based project designed to perform Google searches, extract links from the search results, and scrape the content from those links. The project uses Selenium for simulating human-like interactions to avoid being flagged as a bot. The scraped data can be exported in multiple formats, such as CSV, JSON, and Excel.

Last synced: 19 Feb 2025