Data Science Resources

Predicting the Price of a House

Developed a regression model and used machine learning to analyze how crime rate and number of rooms influence home prices. Utilized Python, Anaconda, Pandas, Numpy, Sklearn, and Seaborn for data analysis and model building. Found that the KNN model performed better than Linear Regression in predicting home prices based on these variables.

IPL Win Percentage Calculator

Developed a logistic regression and utilized machine learning model to predict the probability of winning an IPL match using features such as runs needed, balls remaining, wickets fallen, and venue. Preprocessed ball-by-ball IPL data and evaluated model performance across various game segments to ensure accuracy.

The Impact of War on Ukraine’s GDP Relative to Other Countries

Developed a comprehensive analysis and forecasting tool using GDP data to identify and compare Ukraine’s economic trajectory with the most similar country based on GDP metrics from 2018 to 2021. Utilized the Euclidean distance to pinpoint the country with the closest GDP profile, then applied ARIMA time series modeling to forecast GDP for both Ukraine and the identified country for 2022-2024. The results were visualized through detailed plots showcasing both historical and projected GDP data, facilitating insights into economic trends and potential future scenarios.

datascience logo

Your One-Stop Shop for Data Science Resources by Neel Jain, a high school data science student.