Osman Omer Mustafa

Medical Student & Aspiring Researcher

Assiut University | Faculty of Medicine

About Me

Fourth-year Medical Student at Assiut University. Building a foundation in Clinical Research with a focus on Medical Data Analysis. Actively learning R programming to contribute to evidence-based medicine.

Future Goals

My goal is to gradually build skills in research methodology and data analysis, and to contribute to systematic reviews, meta-analyses, and other research projects in the future.

Learning Log

Exploratory Data Analysis (EDA)
JAN 4, 2026

Clinical Data Exploration

Initial exploratory project on medical records utilizing public datasets.

R Tidyverse
View on GitHub →
JAN 6, 2026

Heart Disease Descriptive Study

Descriptive analysis focusing on visual comparisons through histograms.

R Histograms Tidyverse
View on GitHub →
JAN 9, 2026

Heart Failure Clinical Analysis

Exploring clinical features and correlations within heart failure patient data.

R Box plots Tidyverse Histogram
View on GitHub →
JAN 11, 2026

Air Quality Analysis (JHU Case Study)

Replicating a Johns Hopkins University case study to analyze air pollutants.

R JHU-Curriculum Case Study
View on GitHub →
1/4

Statistical Inference
JAN 17, 2026

Heart Disease: t-test Analysis

Comparative analysis of cholesterol levels using Welch t-test and normality assessment via QQ-plots.

Welch t-test p < 0.05 QQ-plots Inferential
View on GitHub →
JAN 25, 2026

Heart Disease: Chi-Square Analysis

Association analysis between chest pain types and heart disease using Chi-Square test and visual representation.

Chi-Square Test Cramer's V Contingency Table p < 0.05
View on GitHub →
JAN 27, 2026

Heart Disease: Correlation Analysis

Investigating the relationship between age and cholesterol levels using Spearman correlation and LOESS smoothing.

Correlation Spearman Scatter Plot p < 0.001
View on GitHub →
JAN 31, 2026

Heart Disease: ANOVA & Group Comparison

Group comparison of cholesterol levels across four chest pain types using Kruskal-Wallis and post-hoc analysis.

Kruskal-Wallis Post-hoc Group Comparison p = 0.03
View on GitHub →
1/4

Linear Regression Series
FEB 2, 2026

Insurance Charges: Simple Linear Regression

Analyzing the impact of age on insurance costs using a simple linear regression model with diagnostic checks.

Linear Regression Diagnostic Plots Scatter Plot
View on GitHub →
FEB 6, 2026

Insurance Charges: Multiple Linear Regression

Predicting insurance charges using age, BMI, and smoking status with multiple linear regression and diagnostic validation.

Multiple Regression Diagnostics VIF
View on GitHub →
FEB 7, 2026

Insurance Charges: Categorical Predictors

Learning to incorporate categorical variables like sex and region into regression models using dummy variables and interpreting their coefficients.

Dummy Variables Categorical Data Coefficient Plot
View on GitHub →
FEB 11, 2026

Insurance Charges: Interaction Effects

Modeling and interpreting the interaction between age and smoking status to see if the effect of age on charges differs for smokers and non-smokers.

Interaction Term Effect Modification ANOVA Slope Change
View on GitHub →
FEB 15, 2026

Insurance Charges: Polynomial Regression

Modeling non-linear relationships between age, BMI, and insurance charges using quadratic and cubic polynomial terms with model comparison.

Polynomial Terms Non-Linear Overfitting Model Comparison
View on GitHub →
1/5

Advanced Regression & Classification

Systematic Review & Meta-analysis

Coming Soon

Training & Certificates

Google Data Analytics
Google | Coursera
SQL Tableau Data Cleaning
Biostatistics in Public Health
Johns Hopkins University | Coursera
Hypothesis testing Regression P-values
Understanding Research Methods
University of London | Coursera
Medical Research Critical Appraisal
R Programming
Johns Hopkins University | Coursera
Functions R Base Visualization

Academic Profiles