Please download main.R, ui.R, utils.R, preprocessing.R, server.R files to run the program
P.S. All other files are our previous codes
This R Shiny application is a powerful, interactive tool designed to simplify data analysis, cleaning, visualization, and machine learning—all in one seamless workflow. It is ideal for data enthusiasts, researchers, and students who want to analyze any dataset without writing complex code.
-
🔍 Dataset Overview
- Upload any
.csvfile - Automatically displays dataset summary, column names, structure, missing values, and duplicates
- Upload any
-
🧹 Data Cleaning
- Detect and handle missing values
- Remove duplicate rows
- Identify and optionally remove outliers manually or automatically
-
📊 Visualizations
- Histogram, Boxplot, Scatterplot
- Correlation heatmap
- Pair plots
- Dynamic selection of variables for customized plotting
-
📈 Outlier Detection
- Boxplot-based outlier detection
- Visual interface to remove or retain outliers
-
🤖 Machine Learning Models
- Automatically runs multiple ML models (e.g., Linear Regression, Decision Tree, Random Forest, etc.)
- Compares models based on accuracy, RMSE, and other relevant metrics
- Highlights the best-performing model
-
🔮 Prediction Engine
- Interactive UI for users to select the target variable
- Automatically detects whether the prediction is regression or classification
- Uses the best-performing model for final prediction
- Allows custom input values for prediction
- Language: R
- Framework: Shiny
- Packages Used:
dplyr- Data ManipulationGGally- Better Visualizations (pair plots)corrplot- Correlation Matricestidyr- Data reshapingrandomForest- Random Forest ML Modelxgboost- Extreme Gradient Boosting ML Powerhousee1071- Support Vector Machinegbm- Gradient Boosting Machineearth- Multivariate Adaptive Regression Splinesggplot2– Visualizationplotly– Interactive plotsDT– Data tablescaret– Model training and evaluationshinyWidgets– UI enhancements