Hi, I'm Harshan.

My day consists of playing with data and applying various machine learing algo to analyse datasets. I am skilled for most data-science steps: data pre-processing, application of statistical and deep learning methods, data visualization and grasping new research in different fieldset.

I am currently a Computer Science under-grad student at a research institute Indian Institute of Information Technology, Guwahati and persuing a parallel BSc degree in Data Science and Programming from IIT Madras { NIRF Rank- 1 },

I'm looking for opportunities to work with more Real World Data. ......more About-Me

2020 JEE Mains & Adv. Top-2 percentile 2020-24 Computer Science & Enginering IIIT Guwahati 2021-24 Data Science and Programming IIT Madras 2023 Research & Data Roles Collaborations Waiting for the opportunity ....

What is Data and Is Data really Important

Let me help you understand data better.


I have developed numerous projects , all in pursuit of transforming cutting-edge Technology into a Product.

Research Projects

Tuning Diffusion Models, Summarization Algorithm & Recommendation Systems.


AI/ML Development

Dr. Cleaner(drone cleaner), Fashion Genie(genai), Attendance System, CV projects



Counter-Factual generation and End-to-End AutoML ( Automating entire DS process ).....ONGOING...



Teaching ML/AI in my college and other schools to garbage cleaning initiative using drone

Building Human Centered AI Artificial Intelligence in increasingly becoming pervasive in all walks of our lives. If we can understand, explain and reproduce it reliably, then we can build sustainable AI for the future. Learn more


A few stories blogs about statistics, Data Science , Machine Learning ,Big data and more.....

Linked with our world will always be related to data ;)

Data Preparation and Cleaning : The secret ingredient to baking a successful ML cake!

25th Dec 2022 - 5 minutes read

Machine Learning (ML) is a powerful field that allows computers to learn from data and make predictions or decisions without explicit programming. It has applications in various domains, from healthcare to finance to self-driving cars. But before we dive into the world of complex algorithms and predictive models, there's a crucial step that often gets overlooked: data preparation and cleaning.

Why is Data Preparation and Cleaning Essential?
-Harshit Singh

Demystifying the Data Science Role: Prepare everything From Resume to In-Person Interview

2nd Dec 2022 - 5 minutes read

The world of data science is booming, and with it comes a growing demand for skilled data scientists. If you're aspiring to enter this exciting field, you need to navigate the job hunt successfully. In this comprehensive guide, we'll break down each step of the data science job search process, explaining key concepts and offering practical tips for success.

Understanding the Data Scientist Role
-Harshit Singh

Your Ultimate Guide to write ChatGPT Prompts for easing Data Science:

20th Feb 2023 - 15 minutes read

In today's data-driven world, data science has emerged as a crucial field for making informed decisions and extracting valuable insights from data. Whether you're a seasoned data scientist, a newbie looking to learn, or someone seeking specific data-related assistance, ChatGPT has got you covered. In this comprehensive guide, we will explore 60 ChatGPT prompts tailored to various data science needs. For each prompt, we'll explain its purpose and provide examples of how it can be used.

1. Train Classification Model
-Harshit Singh

Roadmap to Mastering Data Science and Machine Learning

2nd Dec 2022 - 5 minutes read

Embarking on a journey to master data science and machine learning is an exciting and rewarding endeavor. This comprehensive roadmap is designed to guide you through the essential skills and concepts required to become proficient in these fields. Broken down into 12 sections and spanning a duration of 100 hours, or roughly 2 to 3 months, this roadmap covers everything from Python programming fundamentals to advanced machine learning techniques, data visualization, and even cloud deployment. Let's delve into each section in detail.

Section 1: Python Programming and Logic Building
-Harshit Singh

Scoring Goals or Just Luck in Football? How Hypothesis Tests Make Informed Choices

27th Nov 2022 - 5 minutes read

In the world of statistics, hypothesis tests are powerful tools used to evaluate the validity of assumptions or ideas based on data. Imagine you're the coach of a professional football team, faced with a decision regarding the performance of two players. One player, the new sensation, has a remarkable scoring rate with five goals in two matches. On the other hand, the current star player has a scoring rate of one goal per match but has played 100 matches. You find yourself in a dilemma – is the new player truly a better scorer, or is this just a lucky streak? Hypothesis tests can help you make informed decisions.

Common Tests:
-Harshit Singh

Feel free to contact me. For open source projects, please open an issue or pull request on Github. If you want to follow me, reach me on LinkedIn. Otherwise, send me an email at harshitsingh14@gmail.com.

