https://github.com/403errors/rental-price-predictions
Data-driven model that predicts the rental price of residential properties based on relevant features. By analyzing historical rental data and property attributes, the model aims to provide accurate and reliable rent predictions.
https://github.com/403errors/rental-price-predictions
Last synced: about 1 year ago
JSON representation
Data-driven model that predicts the rental price of residential properties based on relevant features. By analyzing historical rental data and property attributes, the model aims to provide accurate and reliable rent predictions.
- Host: GitHub
- URL: https://github.com/403errors/rental-price-predictions
- Owner: 403errors
- Created: 2023-08-13T09:57:41.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2023-08-14T00:54:34.000Z (almost 3 years ago)
- Last Synced: 2025-01-30T02:15:17.904Z (over 1 year ago)
- Language: Jupyter Notebook
- Homepage:
- Size: 4.63 MB
- Stars: 1
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Rental Price Predictions Project By GuviOfficial

##
Problem Statement
The goal of this project is to develop a data-driven model that predicts the rental price of
residential properties based on relevant features. By analyzing historical rental data and
property attributes, the model aims to provide accurate and reliable rent predictions.
## Dataset Description
There are 25 attributes in each case of the dataset which are:
1. **id**: A unique identifier for each property listing.
2. **type** : The type of property, such as BHK1, BHK2, RK1, etc.
3. **locality**: The specific neighbourhood or area where the property is located.
4. **activation_date**: When the property listing was activated or made available for
rent.
5. **latitude**: The geographic latitude coordinate of the property's location.
6. **longitude**: The geographic longitude coordinate of the property's location.
7. **lease_type**: The type of lease, such as FAMILY or BACHELOR, or ANYONE
8. **gym**: Indicates whether the property has a gym or fitness facility.
9. **lift**: Indicates whether the property has an elevator or lift.
10. **swimming_pool**: Indicates whether the property has a swimming pool.
11. **negotiable**: Indicates whether the rent price is negotiable.
12. **furnishing**: Describes the level of furnishing, e.g., fully furnished, partially furnished,
unfurnished.
13. **parking**: Specifies the availability of parking facilities.
14. **property_size**: The size of the property in terms of square footage or square meters.
15. **property_age**: The age of the property since construction.
16. **bathroom**: The number of bathrooms available on the property.
17. **facing**: The direction in which the property faces, e.g., north, south, east, west.
18. **cup_board**: Indicates the presence of cupboards or storage units.
19. **floor**: The floor number on which the property is located.
20. **total_floor**: The total number of floors in the building.
21. **amenities**: Additional amenities or features provided with the property.
22. **water_supply**: The type and availability of water supply.
23. **building_type**: The architectural style or type of building, e.g., Apartment, Individual
House
24. **balconies**: The number of balconies or outdoor spaces.
25. **rent**: The target variable, represents the rental price for the property.
## Aim
- Develop a data-centric framework with the goal of predicting residential property rental costs based on relevant features.
- Anticipate rental prices by analyzing significant property attributes and characteristics.
- Utilize historical rental data and property details to enhance the accuracy of predictions.
- Strive to provide reliable and precise rent forecasts using the developed model.
## Technologies
Technologies that were used while creating this project were -
- **Cleansing** : Data cleansing was indispensable to eliminate inconsistencies, errors, and outliers from the rental dataset.
- **EDA** : Through visualizations, statistical summaries, and pattern recognition, EDA enabled the identification of trends, correlations, and potential hidden insights among the rental-related attributes.
- **Visualization** : Visual representations of data distributions, trends, and geographical patterns aided in presenting a clear and coherent narrative regarding the factors influencing rental prices. This not only fostered better understanding among project participants but also guided informed decisions on feature engineering and model design.
- **ML** : By using ML techniques, the model was able to learn from historical rental data and attribute relationships discovered during the EDA phase. This enabled the model to generalize patterns, predict rental prices for new instances, and adapt to changing rental market dynamics over time.
The culmination of these technologies ensured that the predictive model achieved the desired accuracy, robustness, and applicability in forecasting rental prices for residential properties.
## Requirements
- Python3 and pip
## Libraries
- **Matplotlib** : It is the basic plotting library in Python. It provides tools for making plots.
- **Pandas** : It provides tools for data storage, manipulation and analysis tasks.
- **Seaborn** : Used in making statistical graphics
- **json** : It allows you to encode and decode custom objects by using JSON encoder and decoder classes.
- **numpy** : It provides a fast numerical array structure and operating functions.
- **sklearn** : It provides a selection of efficient tools for machine learning and statistical modelling.
## Data Cleaning
### Type: 1 - Unnecessary Data
- id
- locality: Data within this column spans over a wide range.
### Type: 2 - Format Conversion
- activation_date
### Type: 3 - Boolean Data
- gym
- swimming_pool
- lift
- negotiable
### Type: 4 - Categorical Data
- type
- lease_type
- furnishing
- parking
- facing
- water_supply
- building_type
### Type: 5 - Numerical Data - No preprocessing required
- latitude
- longitude
- property_size
- property_age
- bathroom
- cup_board
- floor
- total_floor
- balconies
### Type: 6 - String Object Data
- amenities
## References
The concepts in this project have been referred from the following websites -
- https://en.wikipedia.org/wiki/Linear_regression
- https://en.wikipedia.org/wiki/Root-mean-square_deviation