What is the difference between Data Analytics and Business Analytics?

What is the difference between Data Analytics and Business Analytics?

Most Business Managers use Data Analytics and Business Analytics terms interchangeably. However, there is subtle difference between these terms and subtleties are based on the end usage of the...
Handling Date and Time data in R – Part 2

Handling Date and Time data in R – Part 2

This is the second article of two part article series on handling date and time data in R. You can read the first article here. When date and time data are imported into R they will often default to a...
Handling Date and Time data in R – Part 1

Handling Date and Time data in R – Part 1

A time series is a list of observations ordered successively over time. In a time series, observations are often recorded at evenly-spaced intervals, such as daily, weekly, monthly, and so on. This is...
String Manipulation in R – turning text into useful form for analysis

String Manipulation in R – turning text into useful form for analysis

String manipulation refers to the process of modifying, analyzing, or transforming strings, into useful form for analysis. This is a fundamental aspect of programming and is used in various...
Mastering Text Output in R: From Basics to Advanced Techniques

Mastering Text Output in R: From Basics to Advanced Techniques

Character vectors in R are used to store text data. R also provides a variety of built-in functions to deal with character vectors. Perhaps the most basic thing we can do with texts is to view them. R...
Unlocking Insights: Visualizing Data with Heatmaps for Clearer Correlations

Unlocking Insights: Visualizing Data with Heatmaps for Clearer Correlations

Heatmaps are used to show relationships between two variables, one plotted on each axis. By observing how cell colors change across each axis, you can observe if there are any patterns in value for...
A Beginner’s guide to Line charts in R

A Beginner’s guide to Line charts in R

A line chart, connects a series of data points using a line. Line chart also displays sequential values to help you identify trends. In business, Line chart is widely used to display the time series...
Get introduced to R’s apply Family: Your Guide to apply( ), lapply( ), sapply( ), and tapply( )

Get introduced to R’s apply Family: Your Guide to apply( ), lapply( ), sapply( ), and tapply( )

In R, for loop is used to repeat evaluating an expression with an iterator on list or vector. In practice, for loop is almost the last choice because an alternative way is much cleaner and easier to...
Understanding Loop expressions in R

Understanding Loop expressions in R

when starting to use R, most R users use loops whenever they need to iterate over elements of vector, list or data frame. For e.g. if we are interested in the mean of all the numeric columns of the...
Anatomy of a box-plot and how to create it in R

Anatomy of a box-plot and how to create it in R

Histograms and Density Plots show the distribution of the data. We can also show the distribution of a data using few critical points in the data set. These critical points are called as quantile...
Four distinct Bar Plot representations in R

Four distinct Bar Plot representations in R

Bar charts are used to show the comparison of numeric values of a categorical variable. For e.g. sale of different products in financial year. Here Sale is numeric value and product name is...
How to draw histogram in R

How to draw histogram in R

Frequency histograms are useful when you want to get an idea about the distribution of values in numeric variable. The hist() function takes a numeric vector as its main argument. To construct a...
How to generate scatter plot in R?

How to generate scatter plot in R?

Summarizing your data, either numerically or graphically, is an important component of any data analysis. Fortunately, R has excellent graphics capabilities and can be used whether you want to produce...
How to import Excel file into R?

How to import Excel file into R?

In business, perhaps the most widely used file format to store the data is the excel workbook. An excel workbook contains tables which can easily perform calculations. Just as CSV file can be imported...
How to import and export CSV file in R?

How to import and export CSV file in R?

The first step in any kind of data analysis in R is to load the data, that is, to import a dataset into the R Environment. There are variety of data files you can import into R, which includes: Among...
Need HELP in R? A sneak peak into R’s Help ecosystem.

Need HELP in R? A sneak peak into R’s Help ecosystem.

When you wanted to import the text file in R, you must have used read.table() function. But do you know that you can pass on more than 20 different arguments to read.table() function? You must have...
Learn 6 easy functions from base R to spot check your data set

Learn 6 easy functions from base R to spot check your data set

When the new data is presented to you for the analysis, you would like to get first hand information on the data set before diving deep into analysis. This is similar to doing a warm up exercises...
6 Fallacies of Data Mining

6 Fallacies of Data Mining

When I speak to beginner data analysts, I could hear few miss-beliefs about data mining tools. I am writing this article to debunk some data mining fallacies. Here are the list of six Data Mining...
What is t-distribution? – An intuitive understanding using business example

What is t-distribution? – An intuitive understanding using business example

In my last article we learned how to estimate population mean by using sample mean when population standard deviation is known. In most instances, the population standard deviation will be unknown and...
What is the difference between Point estimation and interval estimation?

What is the difference between Point estimation and interval estimation?

What is point estimation and what is its fundamental drawback? The use of single sample value such as X_bar (sample mean value) to estimate the population value is known as point estimation, because...
What are different stages of Data Science work? A high-level overview of the process of data science.

What are different stages of Data Science work? A high-level overview of the process of data science.

In this article we will have high-level overview of the process of data science. We will look at different stages of data science work. The process of solving a data science problem is summarized in...
How to remove skewness in the data? Learn 6 powerful data transformations.

How to remove skewness in the data? Learn 6 powerful data transformations.

Many statistical tests assume that the data is normally distributed. Hence if the underlying data is not normal, we need to transform a data to make it near normal before we apply these tests. The...
An intuitive understanding of Sampling distribution and central limit theorem using practical example

An intuitive understanding of Sampling distribution and central limit theorem using practical example

The basic idea of inferential statistics is to use a statistic (mean,Standard Deviation etc.) calculated on a sample in order to estimate a parameter of a population (mean,Standard Deviation etc.)...
A brief introduction to Linear Transformation

A brief introduction to Linear Transformation

In data analysis, we may obtain greater insight while expressing a variable in different form. For e.g. you could use different scale to better visualize the variables that have points close by. Many...
Is your data Normal? A gentle introduction to Normality Tests

Is your data Normal? A gentle introduction to Normality Tests

There are range of techniques that you can use to check if your data sample deviates from a Gaussian distribution, called normality tests. In this tutorial, we will learn some techniques that can be...
A gentle introduction to Sampling terms and definitions – A pre-requisite to inferential statistics.

A gentle introduction to Sampling terms and definitions – A pre-requisite to inferential statistics.

In this tutorial we will get comfortable with some of the commonly used terms from the field of Sampling theory The clarity on these terms is required to understand the inferential statistical...
An intuitive understanding of Normal Distribution using business example

An intuitive understanding of Normal Distribution using business example

In this tutorial, we will have gentle introduction to normal distribution with real world example. We will generate normal distribution plot in R and learn some R functions to calculate the...
An intuitive real life example of Poisson distribution and how to simulate it in R?

An intuitive real life example of Poisson distribution and how to simulate it in R?

In this tutorial we will look at Poisson distribution characteristics, build Poisson distribution formula and look at some R functions to calculate the probability of occurrence using Poisson...
An intuitive real life example of a binomial distribution and how to simulate it in R

An intuitive real life example of a binomial distribution and how to simulate it in R

In this tutorial, we will understand the assumptions of binomial distribution, take a business example of binomial distribution, build the binomial distribution formula and use R to solve the problem...
A gentle introduction to covariance and correlation

A gentle introduction to covariance and correlation

In this tutorial we will look at two measures of relationship between two numeric variables: the covariance and coefficient of correlation...
How to calculate and visualise the distribution of Numeric Variable?

How to calculate and visualise the distribution of Numeric Variable?

More commonly used dispersion measures in statistics are variance and standard deviation. These measures give summary statistics, hence does not tell much about the overall data. A five number summary...
A gentle introduction to descriptive statistics – examining univariate distributions

A gentle introduction to descriptive statistics – examining univariate distributions

Descriptive statistics is used to describe the data. A first step in this process is to check the distribution of values of each numeric variable. In this tutorial, various tools for describing the...
What is Statistics and why it is important for data analysis?

What is Statistics and why it is important for data analysis?

In this post you will discover why statistics is important in general and for data science in particular, and types of methods that are available...
What are R Packages and how to install them?

What are R Packages and how to install them?

R packages are extensions to the R programming language. R packages contain code, data, and documentation in a standardized collection format that can be installed by users of R, typically via a...
CRISP-DM and KDD methodology – a structured approach to planning a data analytics project.

CRISP-DM and KDD methodology – a structured approach to planning a data analytics project.

Machine learning, knowledge discovery from data and related areas experienced strong development in the 1990s. Both in academia and industry, the research on these topics was advancing quickly...
Unraveling data types in R – The list – A gentle introduction

Unraveling data types in R – The list – A gentle introduction

In R, a list is an ordered collection of objects, like vector, but lists can actually combine objects of different types. List elements can contain any type of object that exists in R...
What is reproducible data science project?

What is reproducible data science project?

Think back your first chemistry or biology lab course. As you entered into a lab, what was the first thing that were taught? It was not Chemistry or Biology. For most of us, the first instruction in...
Unraveling R Data Types – The Factors – A gentle introduction

Unraveling R Data Types – The Factors – A gentle introduction

The factor data type is used to represent character data. This character data, however takes a small number of distinct values. Each distinct value is represented by a integer code, which is called as...
Unraveling R data types – The Data Frame – A gentle introduction

Unraveling R data types – The Data Frame – A gentle introduction

A data frame represents a data with a number of rows and columns. Unlike matrix, data frames can contain variables with different data types, therefor Data Frames are heterogeneous...
Unraveling R data types – The Matrix – A gentle introduction

Unraveling R data types – The Matrix – A gentle introduction

In R, matrix is a vector with two additional attributes, the number of rows and the number of columns. Since vectors are the building blocks of matrices, like vectors, matrices are also constrained to...
Unraveling R data types – The Vectors – A gentle introduction

Unraveling R data types – The Vectors – A gentle introduction

The first step in learning R programming is getting familiar with basic R objects and their structure. The fundamental data object in R is a vector. In this article we will define the R objects...
Getting started with R – Part 2: RStudio installation guide

Getting started with R – Part 2: RStudio installation guide

This is the second post in the “Getting Started with R Programming” series. In the previous post, we discussed the processes for getting the R programme from the CRAN website. We also...
Getting started with R – Part 1: R installation guide

Getting started with R – Part 1: R installation guide

In this blog we will install R (for Windows and Mac OS) and have a quick tour of R environment...

What R is and what it is not?

R, as a programming language, has been evolving and developing over the last 20 years. Its goal is quite clear to make it easy and flexible to perform comprehensive statistical computing, data...
Why data science workflow requires programming?

Why data science workflow requires programming?

It is possible to do lot of data work using Excel, Tableau, or any other Business Intelligence tools that have graphical interfaces. These BI tools are known for taking any kind of data from almost...
What is Data Mining, and how is it used in Business?

What is Data Mining, and how is it used in Business?

Data mining is used to search for valuable information from vast amount of data collected over time. The information may be certain patterns or relationships that exists within a data. Businesses use...
What are different types of Data Science Roles?

What are different types of Data Science Roles?

Data science is the practice of using data to try to understand and solve real-world problems. If you’ve looked into the different areas of data science, you may be familiar with Drew Conway’s popular...
What is Data Science?

What is Data Science?

Without data, you’re just another person with an opinion – W. Edwards Deming, noted Statistician, Professor, Author, and Lecturer “The sexiest job of the 21st century.” Data scientist, a title...

Follow us on Social Media

Scroll to Top