Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/arunsathiya/real-estate-web-scraping-and-analysis
https://github.com/arunsathiya/real-estate-web-scraping-and-analysis
Last synced: 20 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/arunsathiya/real-estate-web-scraping-and-analysis
- Owner: arunsathiya
- Created: 2024-01-10T07:32:30.000Z (almost 1 year ago)
- Default Branch: main
- Last Pushed: 2024-01-10T07:34:08.000Z (almost 1 year ago)
- Last Synced: 2024-11-01T11:07:27.640Z (2 months ago)
- Language: Jupyter Notebook
- Size: 1.84 MB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
## Data analysis project using Python programming
In this project, Python is used to gather data from a housing research website, and analysis is made on the data gathered.
## Libraries used
- bs4 is the BeautifulSoup library used for parsing HTML and XML documents.
- pandas is a library used for data manipulation and analysis.
- numpy is a library used for mathematical operations on arrays and matrices.
- requests is a library used for making HTTP requests to websites.
- re is a regular expression library used for pattern matching and string manipulation.
- Seaborn is a library for data visualization built on top of matplotlib
- Matplotlib is a plotting library for creating static, animated, and interactive visualizations in Python## Sample of researches made
- Exploratory data analysis
- Price distribution across houses
- Average Increase in House Price for each city
- Factors that affect house prices
- Average housing prices by cities
- Effects of bedroom count on house prices
- Average bath per price
- County-wise Average price per Square Feet## Models used
- OLS - Multiple Linear Regression
- Predictive analysis
- Residual plots