Abhishek Singh, Author at DataSpoof

Explaining the concepts of dimensionality reduction

Post author:Abhishek Singh
Post published:January 20, 2023
Post category:Machine Learning

Dimensionality reduction is a technique that is used for reducing the number of input variables present in the dataset. This is done to ease the task of modeling as the…

8 ways to handle Imbalance data in Python

Post author:Abhishek Singh
Post published:January 17, 2023
Post category:Data Science Machine Learning Python

Hello everybody, I hope everything is going well. We will talk about 8 ways to handle Imbalance data in Python in this blog post and how it impacts the outcome…

9 popular types of data visualization in Python and R

Post author:Abhishek Singh
Post published:January 16, 2023
Post category:Data Science

One of the most important steps in gaining insight into the data is data visualization. It's up to us to decide which of the many tactics and plot types we…

Regular expression in Python

Post author:Abhishek Singh
Post published:January 15, 2023
Post category:Python

A regular expression, sometimes known as "regex," is an effective tool for finding patterns in text. A regex can be used to match a single character or a predetermined group…

How to detect and remove outliers in Python

Post author:Abhishek Singh
Post published:January 13, 2023
Post category:Machine Learning

Outliers are one of the key parts of a dataset that must be removed during cleaning and pre-processing by using feature engineering approaches. Let us understand what outliers is; these…

Machine learning explained in a simple way

Post author:Abhishek Singh
Post published:January 10, 2023
Post category:Machine Learning

Introduction Machine learning is a form of artificial intelligence that helps us build software applications that can make accurate predictions. Without explicitly programming the models, the machine learning algorithms use…

Various types of data compression in MapReduce

Post author:Abhishek Singh
Post published:August 12, 2022
Post category:Big Data

When Hadoop word comes to mind instantly, one more word also comes side by side in mind which is big data. Big data means a very large amount of data.…

How to read different types of files in Hadoop MapReduce

Post author:Abhishek Singh
Post published:August 11, 2022
Post category:Big Data

In Hadoop, we can read different types of files using map-reduce. As different files have different types of formats. We can’t read all in the same manner. So, we will…

How to calculate Median and Standard deviation using MapReduce

Post author:Abhishek Singh
Post published:August 10, 2022
Post category:Big Data

MapReduce is a programming model used to perform data analysis on large amounts of data in a scalable manner without any data loss. We can perform many different types of…

How to find an inverted index of the sample using MapReduce

Post author:Abhishek Singh
Post published:August 9, 2022
Post category:Big Data

MapReduce is a programming model used to perform a different type of analysis on a large amount of data. Here we are using MapReduce to generate an inverted index on…