Data Analytics Engineer & AI Product Strategist (AI product developer) with hands-on experience developing and implementing full-stack Generative AI applications with Tesseract OCR, OpenAI, LangChain, Streamlit, Pandas, SQLite, and Google Sheets API. Created and implemented a lot of AI-driven web applications that tackle real-world problems in the fields of natural language processing, healthcare, finance, and human resources. Competent in Big Data, Azure Cloud, Power BI, SQL, Python, and Databricks; passionate about low-code AI development and data engineering. Adept at automating intricate data workflows, building interactive dashboards, and extracting useful information from sizable, disorganised datasets to aid in strategic decision-making. Experienced in using Azure Data Factory to create reliable ETL pipelines, Databricks and PySpark to process and analyse massive amounts of data, and data architecture optimisation for dependability, performance, and profitability.
This portfolio is a reflection of my journey — showcasing my work, my approach to problem-solving, and the creative spirit I bring to every project.
Led end-to-end dashboard projects for clients using Power BI, SQL, and Python, automating reports to deliver real-time business insights and KPI monitoring. Built data models and DAX measures to identify market trends, customer behavior, and operational inefficiencies for clients in e-commerce and finance.
Built the AI Market Analyst, a real-time stock and news intelligence platform that uses natural language querying, portfolio tracking, and GPT-4-powered summaries—making financial insights accessible to non-experts. Created impactful tools like the Resume Analyzer, Medical Report Analyzer, and Weather ETL Dashboard—each solving real-life problems using GenAI, data engineering, and visual analytics to simplify complex decision-making
1. Built a Streamlit app that lets users upload medical PDFs/images, extracts diagnostic values using Tesseract OCR, and identifies abnormal parameters. 2.Generates doctor-style impressions using GPT-4 and visualizes health trends with Altair graphs for longitudinal insights. 3.Integrated Google Sheets API for authentication, user tracking, and enforcing per-user usage limits with real-time login stats
1.Developed an AI-powered career tool where users upload resumes and job descriptions to get match scores, gap analysis, and improvement suggestions via GPT-4. 2.Supports PDF extraction with PyPDF2, vector-based similarity matching, and AI summarization of candidate-job alignment. 3.Streamlit UI includes email/password login with Google Sheets storage, daily usage tracking, and user-specific dashboards.
1.Created a personal stock analyst tool where users can ask GPT-4 natural-language questions about real-time stock data and related global news. 2.Uses Yahoo Finance and NewsAPI to fetch and analyze live market prices and headlines, then generates smart summaries with GPT. 3.Includes Google Sheets-based user management, portfolio tracking (up to 5 stocks), and a dashboard with charts, export to CSV, and usage limits.
1.Built a modular ETL pipeline using Python scripts to extract, transform, and store live weather data into a SQLite database. 2.A Streamlit dashboard allows dynamic city search, visualizes real-time weather metrics with charts, and optionally saves entries to the database. 3.Includes support for running the ETL via terminal and managing historical city weather logs from a local UI.
1. Applied advanced data analysis techniques, including time series analysis and accurate sales forecasting, to uncover insights that support strategic decision-making and growth planning. 2. Designed interactive sales dashboards in Power BI with real-time visualizations, filters, slicers, and region-wise map charts to drive data-informed business success.
1. Built interactive dashboards to analyze online sales data using advanced filters, slicers, and user-driven parameters for dynamic insights. 2. Performed data modeling and customisation, including complex joins, calculations, and diverse visualizations like bar, pie, doughnut, line, area, scatter charts, and maps
A real-time data pipeline that captures streaming data from IoT or social media feeds using Azure Event Hubs. Data is processed with Azure Databricks, stored in Delta Lake, and visualized through Power BI.
Powerful language for data analysis, automation, AI, and application development with vast libraries like Pandas and Scikit-learn.
Link to courseUsed to query and manage structured data in relational databases like MySQL, PostgreSQL, and SQL Server.
Link to courseA Microsoft tool for building interactive dashboards and business reports to uncover insights from data.
Link to courseTransforming raw data into actionable insights using tools like matplotlib, seaborn, and Plotly.
Link to courseDesigning, building, and maintaining scalable data pipelines for large-scale data processing and storage.
Link to courseETL (Extract, Transform, Load) is used to collect, clean, and organize raw data into meaningful business formats.
Link to courseA unified data platform for big data analytics, combining Apache Spark with Azure cloud services.
Link to courseCloud-based ETL and data integration tool that orchestrates and automates data movement and transformation.
Link to coursePython API for Apache Spark, used to process large-scale datasets with distributed computing capabilities.
Link to courseCrafting optimized prompts to guide AI tools like ChatGPT to generate accurate, task-specific outputs.
Link to courseBuilds AI-powered applications quickly using no-code and low-code platforms like Bubble, Glide, and Make.com.
Link to courseBuilds AI-powered applications quickly using no-code and low-code platforms like Bubble, Glide, and Make.com.
Link to course