top of page


How Semantic Search is Transforming the Way We Find Information
This article explores how the search functionality used to work in the past and how it has been revolutionalized using the semantic search a
shivamshinde92722
Jan 244 min read


Classifying the Unstructured: A Guide to Text Classification with Representation and Generative Models
This article will delve into the various methodologies to perform text classification using transformer-based models, explaining their princ
shivamshinde92722
Jan 153 min read


From Words to Vectors: Exploring Text Embeddings
This article will guide you through the various techniques for transforming text into formats that machines can understand.
shivamshinde92722
Jan 116 min read


Beyond Labels: The Magic of Autoencoders in Unsupervised Learning
In a world where labeled data is often scarce, autoencoders provide a powerful solution for extracting insights from unstructured data
shivamshinde92722
Oct 10, 20246 min read


Efficient Backpropagation: Exploring the Role of Jacobian Matrices and the Chain Rule
In this article, I will tough upon the topic of Jacobian matrix and how it is used in the back-propagation operation of deep learning.
shivamshinde92722
Oct 9, 20242 min read


The Tug of War: Accuracy and Interpretability in Machine Learning Models
Model prediction accuracy is crucial, but not always the top priority. Sometimes, we may need to sacrifice some accuracy to make the model m
shivamshinde92722
Sep 12, 20242 min read


Unlocking the Potential of Pandas: Must-Know Methods for Efficient Data Handling in Python
In this article, you will learn the very important and must-know methods present in the Python Pandas library.
shivamshinde92722
Feb 9, 20245 min read


From Good to Great: Elevating Model Performance through Hyperparameter Tuning
This article will explain the concept of hyperparameter tuning and the different methods that are used to perform this tuning, and their imp
shivamshinde92722
Jan 27, 20246 min read


Guardrails for ML and DL Models: A Deep Dive into Regularization Techniques
This article will explain the concept of regularization in ML and DL. The article will also introduce readers to some of the most famous reg
shivamshinde92722
Jan 23, 20247 min read


Pause for Performance: The Guide to Using Early Stopping in ML and DL Model Training
This article will explain the concept of early stopping, its pros and cons, and its implementation using Scikit-Learn and TensorFlow.
shivamshinde92722
Jan 20, 20247 min read


Data Reliability 101: A Practical Guide to Data Validation Using Pydantic in Data Science Projects
This article will explain Why data validation is needed for the Python code, How it’s done using the Pydantic library, and How to integrate
shivamshinde92722
Jan 12, 20247 min read


A Step-by-Step Guide to Building an End-to-End Machine Learning Project
This article will show you the way you can create an end-to-end machine learning project.
shivamshinde92722
Jan 7, 20244 min read


From Raw to Refined: A Journey Through Data Preprocessing — Part 6: Imbalanced Datasets
This article will explain the concept of imbalanced datasets and the methods used to handle them.
shivamshinde92722
Jan 6, 20245 min read


From Raw to Refined: A Journey Through Data Preprocessing — Part 5: Outliers
A Simple Guide to Navigating Data Anomalies. Decode the mystery behind outliers in data science. From detection to resolution, empower your
shivamshinde92722
Aug 31, 20236 min read


From Raw to Refined: A Journey Through Data Preprocessing — Part 4: Data Encoding
Humans can understand textual information. However, this is not the case for machines and any algorithms that machines run. Machines and alg
shivamshinde92722
Aug 30, 20235 min read


From Raw to Refined: A Journey Through Data Preprocessing — Part 3: Duplicate Data
This article will explain how to identify duplicate records in the data and, the different ways to deal with the problem of having duplicate
shivamshinde92722
Aug 27, 20234 min read


From Raw to Refined: A Journey Through Data Preprocessing - Part 2: Missing Values
This article will explain the concept of missing values in the data and the ways to deal with data containing missing values.
shivamshinde92722
Aug 5, 20236 min read


From Raw to Refined: A Journey Through Data Preprocessing — Part 1: Feature Scaling
This article is the part 1 of the Data Preprocessing series. In this part, I explain the feature scaling step of the preprocessing.
shivamshinde92722
Jul 29, 20233 min read


Be Confident in your Machine Learning Models with the help of Cross-Validation
Cross-validation is a go-to tool to check if your machine-learning model is reliable enough to work on new data. This article will discuss c
shivamshinde92722
Jul 22, 20234 min read

From Chaos to Order: Harnessing Data Clustering for Enhanced Decision-Making
This article will show the important use cases of data clustering methods, how to use these methods, and also show how one can use these met
shivamshinde92722
Jul 15, 20236 min read

From Many to Few: Tackling High-Dimensional Data with Dimensionality Reduction in Machine Learning
This article will discuss the curse of dimensionality in machine learning problems and dimensionality reduction as a solution for the...
shivamshinde92722
Jul 2, 20234 min read


Unlocking the Potential of Text: A Closer Look at Pre-Embedding Text Cleaning Methods
This article will discuss different cleaning techniques that are essential to obtain maximum performance from textual data
shivamshinde92722
Jun 29, 20233 min read

Striking the Right Balance: Understanding Underfitting and Overfitting in Machine Learning Models
This article will explain the basic concept of overfitting and underfitting from the machine learning and deep learning perspective.
shivamshinde92722
Apr 14, 20234 min read

Transform Your Data Science Project: Discover the Benefits of Storing Variables in a YAML File
This blog post will discuss the benefits of using a YAML file as a central repository for storing variables, parameters, and hyper-parameter
shivamshinde92722
Jan 27, 20235 min read
bottom of page