IntelliAnalyst-Automating Data Analysis Through Natural Language InteractionID: 2705 Abstract :IntelliAnalyst Is An Innovative Open-source AI Agent That Streamlines Data Analysis Workflows, Making Advanced Analytics Accessible To Users Without Specialized Expertise. Powered By Large Language Models (LLMs), It Automates The Full Pipeline Starting From Raw Data Upload: Intelligently Detecting Target Variables, Managing Null Values Via Strategies Like Mean/median Imputation Or Interpolation, Applying Contextaware Encoding (e.g., One-hot, Label), Performing PCA-based Dimensionality Reduction, Resolving Duplicates, And Balancing Classes With Methods Such As SMOTE Or ADASYN. By Simply Selecting An Analysis Mode—classification, Regression, Or Clustering—users Trigger LLM-driven Recommendations For Optimal Dataset Splits, Model Selection (e.g., Random Forest, XGBoost, K-means), And Hyperparameter Tuning, Eliminating Manual Intervention. The System Delivers Real-time Model Training, Evaluation Metrics (e.g., F1-score, Silhouette Coefficient, RMSE), And Interactive Visualizations Including Confusion Matrices, ROC Curves, 3D Scatter Plots, Heatmaps, And Word Clouds—all While Prioritizing Data Privacy Through One-time Processing Without Storage. Benchmarks Across Diverse Datasets Show IntelliAnalyst Accelerates End-to-end Analysis By Up To 10x Over Traditional Tools, Yielding Competitive Performance At Minimal Cost (~$0.002 Per Run With GPT-4o). This Agent Bridges The Expertise Gap, Enabling Rapid Insight Generation For Researchers, Analysts, And Domain Experts Alike. |
Published:15-4-2026 Issue:Vol. 26 No. 4 (2026) Page Nos:2334 - 2339 Section:Articles License:This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. How to CiteK. Thanmayee Reddy,V. Harshitha,B. Pravalika Reddy,Dr.Md Jaffer Saddiq,Dr.Naadem Divya, IntelliAnalyst-Automating Data Analysis through Natural Language Interaction , 2026, International Journal of Engineering Sciences and Advanced Technology, 26(4), Page 2334 - 2339, ISSN No: 2250-3676. |