Powerful Proc Tabulate Explained

In this blog, we will use PROC TABULATE , one of the most powerful PROCS for data summarization using SAS. For creating summary table of the information (similar to Pivot in Excel), we need to define Classification Variables, also called dimensions Measurement Variables, also called Facts Structure of the Output Summary Tables PROC FREQ, PROC ... Read morePowerful Proc Tabulate Explained

Concatenating Datasets in SAS

Author: Mrinmoy Saikia Data preparation is one of the most significant steps in Data Science, Analytics or Reporting Projects. In this blog, we focus on learning - "Combining 2 or more SAS datasets Vertically". Combining SAS datasets vertically is also referred as Concatenating Datasets. Concatenating is combining 2 or more datasets one below another. From ... Read moreConcatenating Datasets in SAS

SAS Control Statements

In this blog, we will discuss some of the SAS control statements with examples. IF/WHERE Statement IF or WHERE statement is applied to select observations. WHERE condition is applied while reading observations from Input dataset where as IF condition is applied at Program Data Vector (PDV). Using the Where statement may improve the efficiency of ... Read moreSAS Control Statements

Chi Square Test using SAS

A chi-square test is an statistical method to test association between two categorical variables (especially between nominal variables).  Type of Variables. Correlation Analysis: When both the variables are continuous, and it can be done using Pearson Correlation Coefficient.  Correlation Analysis. ANOVA: One variable is categorical and other variable is continuous. Finding how levels of categorical variable ... Read moreChi Square Test using SAS

ANOVA using SAS and Example

Analysis of Variance (ANOVA) is used for comparing means across multiple samples. Focus here is only 1-Way ANOVA and there are a few different ways of applying similar concepts to different scenarios. If number of samples or groups is one or two, we can use T Test (T Test using SAS). Using one categorical variables, ... Read moreANOVA using SAS and Example

Retain Statement - Explained with Examples

Author: Rameshwari, done SAS for Data Analytics Training course from DnI Institute SAS programs are made of statements and each statements end with semi comma -";" . One of the important statements in SAS is RETAIN statement. Why do we need Retain Statement? Important point about SAS program is that in majority of  cases SAS ... Read moreRetain Statement - Explained with Examples

10 Most Commonly Used Character Functions in SAS

In this blog, we will discuss 10 most common text manipulation functions in SAS. These functions can be used for preparing data for text analytics or predictive model development. Length of String: LENGTH, LENGTHC and LENGTHN Change Case of String: LOWCASE, UPCASE and PROPCASE String Alignment: LEFT and RIGHT Remove or Trim Leading and Trailing Blanks: TRIM ... Read more10 Most Commonly Used Character Functions in SAS

Analysing Count and Proportions - using PROC FREQ

Variables from Analysis perspective are categorical and continuous (details on Variables Types). For summarising categorical variables, counts and proportions are used.  SAS has PROC FREQ procedures to summarise categorical variables. FREQ - read as frequency of variable values. In this blog, we will explore some of the commonly used options and statements of PROC FREQ. ... Read moreAnalysing Count and Proportions - using PROC FREQ

Proc Sort - Options and Scenarios

In a number of SAS PROCs or Procedures, we may want to group the observations together. For example if we are using BY statement in PROC PRINT (All About PROC PRINT), SAS prints observations grouped by BY variable(s) values. When we are using Base SAS and have specified BY statement, the input dataset have to ... Read moreProc Sort - Options and Scenarios

All About PROC PRINT

In this blog, we will discuss some of the commonly used options statements of PROC PRINT in SAS. Below are some of the common tasks which needs to be done and  how these can be achieved using PROC PRINT. Print a SAS dataset Print only a few variables of a SAS dataset (VAR statement) Print ... Read moreAll About PROC PRINT

10 Data Manipulation Scenarios and SAS Codes

Scenarios 1-Creating multiple rows from a single row in the input SAS dataset: You have a pharma shop and each of the medicines has manufacturing date and expiry date for each batch of a medicine. From the input SAS dataset with one observation for a medicine to multiple observations for each of the valid date ... Read more10 Data Manipulation Scenarios and SAS Codes

Predictive Modelling Technique - Logistic Regression - Interpret Output - Part 2

MAXIMUM LIKELIHOOD AND ODDS RATIO Analysis of Maximum Likelihood Estimates Parameters in logistic regression are estimated using Maximum Likelihood Estimation (MLE) functions.  The significance of individual exploratory variable parameters is assessed using Wald Chi Square test. Parameter:  Intercept and exploratory variables used in a logistic model, the weight of these are estimated using MLE DF: ... Read morePredictive Modelling Technique - Logistic Regression - Interpret Output - Part 2

Predictive Modelling Technique - Logistic Regression - Interpret Output

Originally published on RamG Data Analytics & Insights (www.ramganalytics.com) In the previous blog, we  elaborated on Why and How to learn Predictive Modelling? One of the commonly used statistical techniques is Logistic Regression.  In this blog focus is to understand logistic regression out. We are using SAS for executing logistics regression but similar results & ... Read morePredictive Modelling Technique - Logistic Regression - Interpret Output

Interview Process - Evaluating Analytical Skills

In the previous blog, we have shared the list of questions which were asked for evaluating communication, confidence, and technical skills (SAS). In the next round, main expectation was to check the candidate for analytics skills. After the interviewers were comfortable with the technical skills (e.g. SAS in this case), in this round questions were asked ... Read moreInterview Process - Evaluating Analytical Skills

Analytics Interview Process and Questions: Real Case

Typically an analytics job interview process involves multiple stages or rounds. In each of these stages, various skills are evaluated. Some of the key skills required for Analytics Professional or Data Scientist roles are Communication & Leadership Skills Functional and Business Domain Knowledge Logical and Analytical Skills Technical Skills - Machine Learning & Statistical Techniques ... Read moreAnalytics Interview Process and Questions: Real Case

SAS PROC SORT - New Options

Author: Debosmita Das   SAS PROC SORT is one of the commonly used procedure. Basic functionality of the procedure are discussed here. In this previous blog are How to sort a SAS dataset? - Use PROC SORT How to sort in Descending order? - using descending option before a variable name in BY statement How ... Read moreSAS PROC SORT - New Options

SAS Array and Do Loop – Creating derived variables for Predictive Model Building

In Predictive model development, a number of derived variables are created based on base variable. For example, based on monthly spend, average for combination of the months (e.g. average for last 2, 3 months etc), ratio variables (ration of last month to last 3 months). ARRAY and DO LOOP are quite helpful in creating the ... Read moreSAS Array and Do Loop – Creating derived variables for Predictive Model Building

Key Principles of Data Analytics using SAS

Based on our decades of SAS experience, we want to share with you some of the key principles. As there are exceptions to any rule, you may find for these as well. Incorrect result can erode credibility and trust built over many analyses and years. Hence validation of data analysis results is critical. One of ... Read moreKey Principles of Data Analytics using SAS

Useful SAS functions for Zip Code and Geo Distance

 Author: Prateek Paatni Plotting information on the map is one of the powerful visualization. Plotting on a map require latitude and longitude information of a place. For country level longitude and latitude information, one can refer a compiled country level information on the CountryISOCode,latitude and longitude One can refer Country level Heatmap which is plotted ... Read moreUseful SAS functions for Zip Code and Geo Distance