Hi, I'm Srishti Kumari

B.Tech CSE (Data Science) • Haldia Institute of Technology • Building ML Solutions for Tomorrow

Machine Learning
Data Analysis
AI Solutions

About Me

Hello! I'm Srishti Kumari, a passionate Data Science and Machine Learning enthusiast currently pursuing my B.Tech in Computer Science and Engineering (Data Science) at Haldia Institute of Technology. My journey into the world of data began with curiosity about how algorithms can uncover patterns and drive intelligent decision-making.

With a strong foundation in Python, SQL, and Machine Learning frameworks, I specialize in building end-to-end ML solutions—from data preprocessing and exploratory analysis to model deployment. My experience spans predictive modeling, recommendation systems, and data visualization, with a focus on solving real-world problems through data-driven insights.

I'm particularly excited about applying ML to impactful domains like prediction systems, recommendation engines, and analytical dashboards. Whether it's engineering features for better model performance or crafting visualizations that tell compelling data stories, I thrive on transforming raw data into actionable intelligence.

My Mission

To build impactful AI solutions that drive real-world value through predictive analytics, intelligent automation, and data-driven decision-making. I'm committed to continuous learning and staying at the forefront of ML innovation.

10+

ML Projects

100+

GitHub Commits

7.87

YGPA

Active

Community Member

Technical Skills

Organized by proficiency level and continuously expanding

Proficient

Python

★★★★★

SQL

★★★★★

Pandas

★★★★★

NumPy

★★★★★

Scikit-learn

★★★★★

Matplotlib

★★★★★

Seaborn

★★★★★

Data Cleaning

★★★★★

EDA

★★★★★

Feature Engineering

★★★★★

Supervised Learning

★★★★★

Intermediate

SQL

★★★★☆

Git

★★★★☆

GitHub

★★★★☆

MySQL

★★★★☆

HTML/CSS

★★★★☆

Model Evaluation

★★★★☆

Java

★★★☆☆

Currently Learning

TensorFlow

Deep learning framework for neural networks and advanced ML models

PyTorch

Flexible deep learning library for research and production

Flask/FastAPI

ML model deployment and APIs

Soft Skills

Problem Solving

Demonstrated through end-to-end ML projects tackling real-world challenges

Communication

Public Relations Member at DSCH, leading outreach and event coordination

Collaboration

Active team player in technical clubs and group projects

Continuous Learning

Constantly expanding skill set in emerging ML/AI technologies

Featured Projects

End-to-end Machine Learning solutions demonstrating technical depth and practical impact

Titanic Survival Prediction

Problem: Binary classification challenge to predict passenger survival on the Titanic using demographic and ticket information.

Dataset: Kaggle Titanic dataset (891 training samples, 418 test samples)

Methodology & Results

  • Feature Engineering: Created family_size, title extraction from names, fare_per_person
  • Models: Logistic Regression (78%), Decision Tree (82%), Random Forest (85%)
  • Techniques: Cross-validation, hyperparameter tuning with GridSearchCV
  • Best Model: Random Forest with 85% accuracy on validation set
Python Scikit-learn Pandas Classification

Movie Recommendation System

Problem: Content-based recommendation system to suggest similar movies based on plot descriptions, genres, and keywords.

Dataset: TMDB 5000 Movie Dataset with metadata and credits

Methodology & Results

  • NLP Preprocessing: Tokenization, stemming, stopword removal on plot overviews
  • Vectorization: TF-IDF vectorization of text features (3000+ features)
  • Similarity: Cosine similarity matrix for content-based filtering
  • Output: Top-5 movie recommendations with similarity scores
Python NLP TF-IDF Cosine Similarity

COVID-19 India Data Analysis

Problem: Comprehensive exploratory data analysis of COVID-19 trends across Indian states to identify patterns and insights.

Dataset: Daily COVID-19 cases, deaths, and recoveries by state (2020-2021)

Methodology & Results

  • Data Cleaning: Handled missing values, inconsistent date formats, data aggregation
  • Time-Series Analysis: Trend analysis, moving averages, growth rate calculations
  • Visualization: Interactive plots showing state-wise comparisons, peak periods
  • Insights: Identified hotspot regions, recovery patterns, vaccination impact
Python Pandas Matplotlib Seaborn

Education & Certifications

B.Tech in Computer Science & Engineering (Data Science)

Haldia Institute of Technology (Autonomous)

2023 - 2027

YGPA: 7.87/10

Relevant Coursework
Machine Learning Artificial Intelligence Data Structures & Algorithms Database Management Systems Probability & Statistics Linear Algebra Operating Systems Data Mining Big Data Analytics Computer Networks
  • Strong foundation in mathematical concepts essential for ML/AI
  • Hands-on experience with modern data science frameworks and tools
  • Project-based learning with real-world problem-solving

Certifications & Online Learning

Machine Learning Specialization

Ongoing - Coursera

Python for Data Science

Completed

SQL for Data Analysis

Completed

Git & GitHub

Completed

Achievements & Extracurriculars

Public Relations Member

Data Science Club - DSCH, HIT

Leading outreach initiatives, organizing technical workshops, and coordinating events to promote data science awareness on campus. Responsible for community engagement and building connections with industry professionals.

  • Coordinated 5+ technical workshops and guest lectures
  • Managed social media presence and content strategy
  • Facilitated collaboration between students and industry mentors

Active Member

Nivedita Club, HIT

Contributing to social welfare initiatives and community development programs while balancing technical pursuits with social responsibility.

Active GitHub Contributor

Open Source Enthusiast

Maintaining a consistent GitHub presence with 1000+ commits, showcasing ML projects and contributing to the data science community.

View GitHub Profile

Technical Content Creator

Blog & Documentation

Writing technical articles to share knowledge and explain complex ML concepts in accessible language (see Blog section).

Get In Touch

I'm actively seeking internships and entry-level opportunities in Data Science and Machine Learning. Let's connect!

Contact Information

LinkedIn

Connect with me

GitHub

@Srishtik-ui

Location

Haldia, West Bengal, India

Send a Message