Menu

Most Business Managers use Data Analytics and Business Analytics terms interchangeably. However, there is subtle difference between these terms and … What is the difference between Data...

Data manipulation is the process of changing or transforming data to make it more organized, readable, and useful. It’s used … Why should I use R: The Excel R Data Manipulation...

This is a third article on ongoing series on why you should use R. Here is a quick recap of … Why should I use R: The Excel R Data Visualization comparison Read More »...

This is a second article on ongoing series on why you should use R. In the last article we have … Why Should I use R: The Excel R Data Exploration comparison Read More »...

At present, no sector is untouched by the effects of data analytics. From aviation to manufacturing, retail to pharmaceuticals, gaming … R Vs Excel – What is the difference? Read More »...

This is the second article of two part article series on handling date and time data in R. You can … Handling Date and Time data in R – Part 2 Read More »...

A time series is a list of observations ordered successively over time. In a time series, observations are often recorded … Handling Date and Time data in R – Part 1 Read More »...

String manipulation refers to the process of modifying, analyzing, or transforming strings, into useful form for analysis. This is a … String Manipulation in R – turning text into useful...

Character vectors in R are used to store text data. R also provides a variety of built-in functions to deal … Mastering Text Output in R: From Basics to Advanced Techniques Read More »...

Heatmaps are used to show relationships between two variables, one plotted on each axis. By observing how cell colors change … Unlocking Insights: Visualizing Data with Heatmaps for Clearer...

A line chart, connects a series of data points using a line. Line chart also displays sequential values to help … A Beginner’s guide to Line charts in R Read More »...

In R, for loop is used to repeat evaluating an expression with an iterator on list or vector. In practice, … Get introduced to R’s apply Family: Your Guide to apply( ), lapply( ), sapply( )...

when starting to use R, most R users use loops whenever they need to iterate over elements of vector, list … Understanding Loop expressions in R Read More »...

Histograms and Density Plots show the distribution of the data. We can also show the distribution of a data using … Anatomy of a box-plot and how to create it in R Read More »...

Bar charts are used to show the comparison of numeric values of a categorical variable. For e.g. sale of different … Four distinct Bar Plot representations in R Read More »...

Frequency histograms are useful when you want to get an idea about the distribution of values in numeric variable. The … How to draw histogram in R Read More »...

Summarizing your data, either numerically or graphically, is an important component of any data analysis. Fortunately, R has excellent graphics … How to generate scatter plot in R? Read More »...

In business, perhaps the most widely used file format to store the data is the excel workbook. An excel workbook … How to import Excel file into R? Read More »...

The first step in any kind of data analysis in R is to load the data, that is, to import … How to import and export CSV file in R? Read More »...

When you wanted to import the text file in R, you must have used read.table() function. But do you know … Need HELP in R? A sneak peak into R’s Help ecosystem. Read More »...

When the new data is presented to you for the analysis, you would like to get first hand information on … Learn 6 easy functions from base R to spot check your data set Read More »...

When I speak to beginner data analysts, I could hear few miss-beliefs about data mining tools. I am writing this … 6 Fallacies of Data Mining Read More »...

In my last article we learned how to estimate population mean by using sample mean when population standard deviation is … What is t-distribution? – An intuitive understanding using...

What is point estimation and what is its fundamental drawback? The use of single sample value such as X_bar (sample … What is the difference between Point estimation and interval estimation?...

In this article we will have high-level overview of the process of data science. We will look at different stages … What are different stages of Data Science work? A high-level overview of the...

Many statistical tests assume that the data is normally distributed. Hence if the underlying data is not normal, we need … How to remove skewness in the data? Learn 6 powerful data...

The basic idea of inferential statistics is to use a statistic (mean,Standard Deviation etc.) calculated on a sample in order … An intuitive understanding of Sampling distribution and central...

In data analysis, we may obtain greater insight while expressing a variable in different form. For e.g. you could use … A brief introduction to Linear Transformation Read More »...

There are range of techniques that you can use to check if your data sample deviates from a Gaussian distribution, called normality tests. In this tutorial, we will learn some techniques that can be...

### A gentle introduction to Sampling terms and definitions – A pre-requisite to inferential statistics.

In this tutorial we will get comfortable with some of the commonly used terms from the field of Sampling theory The clarity on these terms is required to understand the inferential statistical...

In this tutorial, we will have gentle introduction to normal distribution with real world example. We will generate normal distribution plot in R and learn some R functions to calculate the...

In this tutorial we will look at Poisson distribution characteristics, build Poisson distribution formula and look at some R functions to calculate the probability of occurrence using Poisson...

In this tutorial, we will understand the assumptions of binomial distribution, take a business example of binomial distribution, build the binomial distribution formula and use R to solve the problem...

In this tutorial we will look at two measures of relationship between two numeric variables: the covariance and coefficient of correlation...

More commonly used dispersion measures in statistics are variance and standard deviation. These measures give summary statistics, hence does not tell much about the overall data. A five number summary...

Descriptive statistics is used to describe the data. A first step in this process is to check the distribution of values of each numeric variable. In this tutorial, various tools for describing the...

In this post you will discover why statistics is important in general and for data science in particular, and types of methods that are available...

R packages are extensions to the R programming language. R packages contain code, data, and documentation in a standardized collection … What are R Packages and how to install them? Read More »...

Machine learning, knowledge discovery from data and related areas experienced strong development in the 1990s. Both in academia and industry, the research on these topics was advancing quickly...

In R, a list is an ordered collection of objects, like vector, but lists can actually combine objects of different types. List elements can contain any type of object that exists in R...

Think back your first chemistry or biology lab course. As you entered into a lab, what was the first thing … What is reproducible data science project? Read More »...

The factor data type is used to represent character data. This character data, however takes a small number of distinct values. Each distinct value is represented by a integer code, which is called as...

A data frame represents a data with a number of rows and columns. Unlike matrix, data frames can contain variables with different data types, therefor Data Frames are heterogeneous...

In R, matrix is a vector with two additional attributes, the number of rows and the number of columns. Since vectors are the building blocks of matrices, like vectors, matrices are also constrained to...

The first step in learning R programming is getting familiar with basic R objects and their structure. The fundamental data … Unraveling R data types – The Vectors – A gentle...

This is the second post in the “Getting Started with R Programming” series. In the previous post, we discussed the … Getting started with R – Part 2: RStudio installation...

In this blog we will install R (for Windows and Mac OS) and have a quick tour of R environment...

R, as a programming language, has been evolving and developing over the last 20 years. Its goal is quite clear to make it easy and flexible to perform comprehensive statistical computing, data...

It is possible to do lot of data work using Excel, Tableau, or any other Business Intelligence tools that have … Why data science workflow requires programming? Read More »...

Data mining is used to search for valuable information from vast amount of data collected over time. The information may … What is Data Mining, and how is it used in Business? Read More »...

Data science is the practice of using data to try to understand and solve real-world problems. If you’ve looked into … What are different types of Data Science Roles? Read More »...

Without data, you’re just another person with an opinion – W. Edwards Deming, noted Statistician, Professor, Author, and Lecturer “The … What is Data Science? Read More »...