{"id":22479672,"url":"https://github.com/djdurga/goibibo_web_scrape_sql_analysis","last_synced_at":"2026-03-19T23:01:31.294Z","repository":{"id":263024365,"uuid":"889111048","full_name":"Djdurga/Goibibo_Web_Scrape_SQL_Analysis","owner":"Djdurga","description":"This project involves web scraping flight data from Goibibo.com and performing SQL analysis on the collected dataset.","archived":false,"fork":false,"pushed_at":"2024-11-15T16:31:21.000Z","size":11,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-10-20T13:02:00.577Z","etag":null,"topics":["sql"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Djdurga.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-11-15T16:22:37.000Z","updated_at":"2024-11-15T16:32:37.000Z","dependencies_parsed_at":"2024-11-19T03:02:42.767Z","dependency_job_id":null,"html_url":"https://github.com/Djdurga/Goibibo_Web_Scrape_SQL_Analysis","commit_stats":null,"previous_names":["djdurga/goibibo_web_scrape_sql_analysis"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/Djdurga/Goibibo_Web_Scrape_SQL_Analysis","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Djdurga%2FGoibibo_Web_Scrape_SQL_Analysis","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Djdurga%2FGoibibo_Web_Scrape_SQL_Analysis/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Djdurga%2FGoibibo_Web_Scrape_SQL_Analysis/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Djdurga%2FGoibibo_Web_Scrape_SQL_Analysis/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Djdurga","download_url":"https://codeload.github.com/Djdurga/Goibibo_Web_Scrape_SQL_Analysis/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Djdurga%2FGoibibo_Web_Scrape_SQL_Analysis/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29589584,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-18T16:55:40.614Z","status":"ssl_error","status_checked_at":"2026-02-18T16:55:37.558Z","response_time":162,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.6:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["sql"],"created_at":"2024-12-06T15:16:26.858Z","updated_at":"2026-02-18T18:33:01.141Z","avatar_url":"https://github.com/Djdurga.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# Goibibo Flight Data Scraping and SQL Analysis\n\n## Introduction\nThis project involves web scraping flight data from Goibibo.com and performing SQL analysis on the collected dataset. The objective is to extract relevant flight information, clean and preprocess the data, and utilize SQL queries to derive insights into flights between Chennai and New Delhi.\n\n## Project Overview\nThis project includes several key steps that contribute to a comprehensive analysis of flight data:\n\n### 1. Web Scraping\nI utilized the Listly web scraping tool to extract detailed flight data from Goibibo for flights between Chennai International Airport and Indira Gandhi International Airport on September 29, 2023. The focus was on economy class flights for that specific date and route.\n\n### 2. Data Cleaning and Preprocessing\nAfter collecting the data, I cleaned and preprocessed the dataset using Microsoft Excel. Key tasks include:\n\n- **Handling Missing Values**: Addressing missing values through imputation or exclusion, based on context.\n- **Standardizing Time Formats**: Converting time-related columns (e.g., Departure Time, Arrival Time, Layover Duration) to a consistent format.\n- **Formatting Duration**: Splitting the `Total_Duration` column into `Total_Duration_Hours` and `Total_Duration_Minutes` for more granular analysis.\n\n### 3. SQL Analysis\nThe core of this project involves SQL analysis, where I formulated and executed several queries to extract meaningful insights from the flight data. Key analysis points include:\n\n- **Average Flight Price**: Determining the average price of flights from Chennai to New Delhi.\n- **Cheapest and Most Expensive Flights**: Identifying airlines offering the cheapest and most expensive flights for the route.\n- **Flight Duration Distribution**: Visualizing the distribution of flight durations for this route.\n- **Flights with Layovers**: Analyzing layover cities and their prevalence, along with layover durations for flights on this route.\n- **Correlation Analysis**: Investigating potential correlations between flight duration and price.\n- **Effect of Free Meals and Layovers on Price**: Exploring how the availability of free meals and layovers affects flight prices.\n- **Airline Frequency**: Identifying airlines with the highest number of flights on the Chennai-New Delhi route.\n\n## Conclusion\nThis project demonstrates my proficiency in web scraping, data cleaning, and SQL analysis. By applying these skills to real-world flight data, I extracted actionable insights that can benefit travelers and airlines alike.\n\n## Repository Purpose\nThis GitHub repository showcases the Goibibo Flight Data Scraping and SQL Analysis project. It serves as an example of my data manipulation, cleaning, and analysis skills, providing potential employers and collaborators with a tangible demonstration of my capabilities.\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdjdurga%2Fgoibibo_web_scrape_sql_analysis","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdjdurga%2Fgoibibo_web_scrape_sql_analysis","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdjdurga%2Fgoibibo_web_scrape_sql_analysis/lists"}