Category: R

Completely Randomized Experimental Design

Completely Randomized Experimental Design, when do you call an experimental design a randomized design? Experimental designs in which the treatments are allocated randomly to the experimental units come under the category of randomized designs....

How to Measure Contingency-Coefficient (Association Strength)

How to Measure Contingency-Coefficient, when the hypothesis of independence of attributes in a contingency table is rejected by performing a chi-square test, ensures the association between two attributes. Such kinds of situations are interested...

Power analysis in Statistics with R

Power analysis in Statistics, there is a probability of committing an error in making a decision about a hypothesis. Hence two types of errors can occur in hypothesis, Type I error and Type II...

Principal Component Analysis in R

Principal Component Analysis in R, PCA is used in exploratory data analysis and for making decisions in predictive models. PCA is commonly used for dimensionality reduction by using each data point onto only the...

Class Imbalance-Handling Imbalanced Data in R

Handling Imbalanced Data in R, Class Imbalance classification refers to a classification predictive modeling problem where the number of observations in the training dataset for each class is not balanced. In other words, the...

Best Algorithm For Stock Prediction

Best algorithm for stock prediction, Stock Prediction-Intraday is one of the trading norms of the stock market, buy shares at the opening time of the market and then sell the same at the closing...

Exploratory Data Analysis (EDA)

Exploratory Data Analysis is one of the critical processes of performing initial investigations on data analysis. Basic idea is to discover the patterns, anomalies, test hypotheses, and check the assumptions with the help of...

Random Forest Feature Selection

Random Forest feature selection, why we need feature selection? When we have too many features or variables in the datasets and we want to develop a prediction model like a neural network will take...

Linear Discriminant Analysis in R

Linear Discriminant Analysis in R, originally developed by R A Fisher in 1936 to classify subjects into one of the two clearly defined groups. It was later expanded to classify subjects into more than...

Customer Segmentation K Means Cluster

Customer segmentation is the process of separation of customers into groups based on common characteristics or patterns so companies can market their products to each group effectively and significantly. In business-to-consumer marketing, most of...