Bhishan Poudel, Ph.D. Candidate

Data Scientist

Linkedin GitHub twitter stackoverflow

Section A: PySpark Introduction

Renaming columns in PySpark

Creating new column using multiple conditions in PySpark



Section B: Kaggle Projects

Various Regressors: Consumer Complaints

Random Forest Regressor: All State Insurance

Various Regressors: Fraud Detection

Various Regressors: House Price Prediction

Random Forest Tuning: House Price Prediction



Section C: Regression

Linear Regression with PySpark

Logistic Regression with PySpark

Logistic Regression Consulting Project with PySpark



Section D: Tree Methods

Random Forest Classification with PySpark



Section E: Recommender Systems

Recommender Systems with PySpark



Section F: Clustering

K-Means Clustering with PySpark

K-Means Clustering Project with PySpark



Section G: Streaming

Streaming with PySpark

Streaming Twitter Data with PySpark



comments powered by Disqus