Ayush Sood
Analyst Portfolio

Data & Project Management Professional • AI Analytics • Digital Transformation


I am currently open to freelance projects and collaborations.

I help organizations turn data into action. With over five years of experience in analytics, AI-driven process automation, and project management, I specialize in delivering measurable results—from cutting costs and boosting engagement to building dashboards and AI solutions that scale. My work spans education, research, and manufacturing, blending technical skill with cross-functional leadership to drive business impact.

Ayush Sood Avatar
Python
Python
  • Pandas
  • Numpy
  • Matplotlib
  • Seaborn
  • Scikit-learn
Excel
Excel
  • Pivot Tables
  • VLOOKUP/XLOOKUP
  • Power Query
  • Dashboards
  • Conditional Formatting
SQL
SQL
  • Joins & Aggregation
  • Window Functions
  • CTEs & Subqueries
  • Data Cleaning
  • Query Optimization
Tableau
Tableau
  • Interactive Dashboards
  • Calculated Fields
  • Storytelling
  • Data Blending
  • Extracts & Publishing
R
R
  • dplyr
  • ggplot2
  • tidyr
  • Statistical Modelling
×
Download CV Contact Me LinkedIn View Projects
GitHub: @Ays0172 LinkedIn: @AyushSood

Netflix Data Analysis: Movies vs TV Shows

Overview:
Explored Netflix’s global library to uncover trends and insights comparing Movies and TV Shows using Python, pandas, and matplotlib. Analyzed over 8,000 titles to answer questions like: Are there more movies or TV shows on Netflix? What are the most common ratings? Which countries dominate the catalog?
Key Steps & Features:
  • Loaded, cleaned, and preprocessed the Netflix Titles dataset (8,000+ entries)
  • Exploratory Data Analysis: missing values, inconsistent formats, deduplication
  • Visualized content rating distribution, movie duration histogram, and top 10 contributing countries
  • Plotted yearly trends for movies vs TV shows with custom charts
  • Summarized key insights on viewing patterns and regional dominance
Impact:
Delivered clear, actionable insights: Movies outnumber TV shows, most content is for mature audiences, and the US & India dominate the catalog. The analysis makes it easy for business, data, and content teams to spot trends and strategize content acquisition.
Netflix Data Analysis Visualizations

Revenue Report – Power BI

Revenue Report Preview

This interactive Sales and Revenue Dashboard, built with Microsoft Power BI, provides a comprehensive analysis of business performance across multiple dimensions. It visualizes monthly revenue trends, retailer-type contributions, and country-wise sales distribution. The report includes detailed COGS tracking, total revenue and cost breakdowns, profit analysis, and dynamic KPIs for rapid decision-making. With category-level insights into camping, mountaineering, and outdoor protection products, this dashboard delivers actionable intelligence for sales optimization.

AI-Powered Customer Churn Analytics – Lloyds Bank Internship (Forage Simulation)

In this project, I analyzed multi-source customer data from SmartBank (a Lloyds Banking Group subsidiary) to deliver actionable, AI-powered insights for customer retention.

Key Steps:

  • Integrated demographic, transactional, service, and digital engagement data into a unified analytics dataset
  • Explored and visualized key churn signals using Python, pandas, matplotlib, and seaborn
  • Engineered features and addressed data quality issues: missing values, inconsistent formats, outliers
  • Built and evaluated a Random Forest model to predict customer churn, achieving a ROC-AUC of 0.82
  • Interpreted feature importances and presented results in a corporate-style, infographic report
  • Employee Data Cleaning using Python

    Employee Data Cleaning Project Screenshot

    Tackled a real-world HR dataset of 15,000 employees, transforming messy, inconsistent raw data into a clean, analytics-ready format. Leveraged Python, pandas, and numpy to handle missing values, fix data types, remove duplicates, address outliers, and ensure robust salary data. The end result: a high-quality dataset ready for insightful HR analytics and machine learning.

    Key Steps:

    • Imported, audited, and profiled raw CSV data
    • Fixed inconsistent data types and handled missing values smartly
    • Removed duplicates and corrected impossible values (e.g., negative salaries)
    • Detected and removed outliers for salary and other numeric fields
    • Delivered a clean, well-structured dataset suitable for dashboards, reporting, or AI models

    Impact:
    Enabled the HR team to trust their analytics, automate reporting, and extract actionable workforce insights—while saving countless analyst hours on manual data fixes.

    Learning Pandas for Data Science

    Tackled a real-world HR dataset of 15,000 employees, focusing on transforming messy, inconsistent raw data into a clean, analytics-ready format. Using Python, pandas, and numpy, I handled missing values, fixed data types, removed duplicates, addressed outliers, and ensured robust salary data. The result: a reliable, high-quality dataset ready for insightful HR analytics and machine learning.

    Key Steps:

  • Imported, audited, and profiled raw CSV data
  • Fixed inconsistent data types and handled missing values smartly
  • Removed duplicates and corrected impossible values (e.g., negative salaries)
  • Detected and removed outliers for salary and other numeric fields
  • Delivered a clean, well-structured dataset suitable for dashboards, reporting, or AI models
  • Learning Numpy for Data Science

    Mastered the fundamentals of NumPy by working with real-world numerical datasets, focusing on efficient data manipulation and computation. Leveraged NumPy arrays to accelerate calculations, perform complex transformations, and enable high-performance analytics—laying a strong foundation for scientific computing and machine learning in Python.

    Key Steps:

  • Loaded and explored numerical data using NumPy arrays
  • Applied array slicing, indexing, and reshaping for flexible data manipulation
  • Performed statistical analysis and aggregations directly on arrays
  • Implemented mathematical operations and vectorized computations for speed
  • Utilized broadcasting and advanced functions for complex data processing tasks
  • Learning Matplotlib for Data Science

    Developed strong proficiency in Matplotlib by visualizing real-world datasets and transforming raw numbers into compelling, insightful graphics. Utilized Matplotlib’s versatile plotting capabilities to create a range of charts and visualizations—enabling clear data storytelling and deeper analytical insights for scientific computing and machine learning projects in Python.

    Key Steps:

  • Designed and customized plots including line graphs, bar charts, scatter plots, and histograms
  • Integrated Matplotlib with NumPy for seamless data-to-visualization workflows
  • Enhanced readability through thoughtful use of labels, legends, titles, and annotations
  • Applied subplots and figure customization for multi-dimensional data exploration
  • Exported publication-quality graphics for reports, presentations, and dashboards
  • Stunning Data Visualizations with Seaborn

    Mastered the art of data storytelling using Seaborn—Python’s most stylish and intuitive statistical visualization library. Explored real-world datasets to craft beautiful, informative graphics and unlock deeper analytical insights. Leveraged Seaborn’s high-level API and powerful aesthetics to make data pop in scientific computing, analytics, and machine learning projects.

    Key Steps:

  • Created rich statistical plots including barplots, violin plots, heatmaps, scatterplots, and pairplots
  • Customized color palettes (e.g., palette="GnBu", "pastel") for visual impact and clarity
  • Combined Seaborn with Pandas for seamless data manipulation and visualization workflows
  • Enhanced visualizations with hue, style, and facet grids for multi-dimensional analysis
  • Generated publication-ready plots for reports, presentations, and dashboards
  • Ticketing System using PHP on XAMPP server

    A simple PHP & MySQL ticketing system for helpdesks and IT support. Submit, track, and manage support tickets via your browser. Easy setup—just run on XAMPP. Includes admin panel, ticket status, and user management. Perfect for students or small teams.

    Blackcoffer Web Data Wrangling & Cleaning

    Extracted and cleaned textual information from Blackcoffer project web pages using Python web scraping techniques. Automated the process of crawling, parsing HTML, and transforming unstructured content into structured, analytics-ready datasets. This project enabled efficient content analysis, streamlined data preparation for NLP tasks, and provided clean data for deeper business insights.