
Baixue (Doris) Zhang
Statistics Grad Student | TA | Aspiring Data Analyst
Projects
Friends-TV-Series Statistical Analysis

R programming | Regression Analysis
This project explores three questions in the Friends TV shows:
1.Line distribution among the six main characters vs. non-main characters.
2. Analysis whether the distributions of Joey’s "How you doin’?" and six main characters’ "I love you" align with the story plot.
3. Explore the relationship between IMdb ratings and viewership through regression modeling.
Refining the Regression Model:
Variable Selection for Automobile Dataset

R Programming | Python | Regression Analysis
In this project, my group analyzed an automobile dataset to develop models for predicting car prices. My primary responsibility was variable selection. Using analytical techniques and R's built-in tools, I identified 8 key predictors from 25 variables.
The main challenges were:
Dealing with multicollinearity among predictor variables.
Managing a significant categorical variable with severe imbalanced group sizes.
Contact
Thank you for viewing my portfolio!
I’m excited to connect and discuss potential opportunities, collaborations, or any questions you may have. Please feel free to reach out via any of the contacts below.