header photo

Social Research Insights



You are Visitor no...


R (Language) Training

R is lingua franca of statistics and open source software available through GNU-GPL. R is not only statistical computing software but also research management system. Since it is a language, the computing capabilities are endless and highly efficient compared to its counter parts. SAS is 11 years behind R and SPSS no way comparable.

Social Research Insights strongly promotes open source tools for quantitative analysis. We are living in information age where software is no longer a private good. Open source tools are basically developed by community rather than a company and available for free. India is second largest country in using pirated software in the world. The worth of piracy stood at $ 35 million in 2014. Those companies that use pirated software only save exiguous and estimated as $ 0.84 million by IDC. On top of it, using pirated software is against to personal and professional ethics. In contrast, using open source software is a virtue and a matter of uprightness.

The area of research has undergone dramatic change. Using software tools for research (academic/corporate) are wide spread today. While there is nothing wrong using commercial software but pirating the same is quite unethical. Data analysis software is target for much of the piracy in business analytics industry. There are many workshops and academics suggest commercial software but pirate. This is a big bug in the industry which Social Research Insights trying to fix. That is why, Social Research Insights recommens open source tools for data analysis.

In fact, data analytics bubble burst n 2013 is due to open source and free software revolution. Some of the open source projects like R, Hadoop, WEKA, Julia, Python changed the way both academia and industry used software.

R is absolutely free maintained by R foundation. Visit for more information. You may also download from download section at project website. The beauty of R is that it is not only free but all those packages (there were more than 6000 packages as on 2015) also free. These packages cover all most all that exist in the world of quantitative analysis. Please visit Cran Task Views for more details as who are using R. R is used in all domains of knowledge viz. chemistry, physics, mathematics, bioinformatics, geoinformatics, production and quality control, sixsigma, operational research. In short it is suitable where are all numbers are involved.


PROGRAM - 1: R for statistical analysis

(10 hours; Two Sundays; each day comprising of 5 hours)

Evolve yourself as a data scientist. Become a data scientist in 10 hours. The following are the contents of the program.

  1. Introduction to data science.
  2. Statistics for decision making.
  3. Introduction to Big data analytics.
  4. Descriptive statistics (Univariate): summary statistics like measures of central tendency, measures of dispersion, measures of shape. Cross tabs.
  5. Inferential statistics (Univariate & Bivariate): Statistical tests (chi-square test, T test, Z test, F test, other parametric and non-parametric tests)
  6. Exploratory statistics (Bivariate & Multivariate): Correspondence analysis (including MCA, MJCA), Principal components analysis, Factor analysis, Cluster analysis.

PROGRAM - 2: R for Advanced Visualization

(5 hours; Two Sundays; each day comprising of 5 hours)

  1. Creating statical graphs - 2D: bar charts, histograms, pie charts, box plots, line charts etc.
  2. Scatter diagrams, linear plots, QQplots, regression plots
  3. 3D plots
  4. Animations
  5. Creating flow charts and block diagrams.
  6. Image manipulation and editing.

PROGRAM - 3: Advanced analytics through R language.

(10 hours; Two Sundays; each day comprising of 5 hours)

  1. (Social) Network analysis.
  2. Artificial neural networks.
  3. Text analytics (Text mining).
  4. Data mining and Web scraping.

PROGRAM - 4: R for bioinformatics.

(10 hours; Two Sundays; each day comprising of 5 hours)

  1. Data display & descriptive statistics
  2. Probability distributions (Continued right side...)
  3. Estimation & Inference
  4. Linear Models
  5. Cluster analysis & Trees
  6. Classification methods
  7. DNA sequencing
  8. Micro array analysis

PROGRAM - 5: R for geoinformatics (GIS).

(10 hours; Two Sundays; each day comprising of 5 hours)

  1. Analysis of (geo) spatial data.
  2. Working with coordinates (longitudes & latitudes)
  3. Plotting geographic maps (Terrain; satellite; road; hybrid etc. )
    1. Downloading map raster
    2. Plotting data on map raster
  4. Plotting 3D maps & Digital embossing.

PROGRAM - 6: R for Quality Controlling & Six Sigma.

(10 hours; Two Sundays; each day comprising of 5 hours)

  1. Quality Controlling
    1. QCC (R package): Shewhart quality control charts for continuous, attribute and count data. Cusum and EWMA charts. Operating characteristic curves. Process capability analysis. Pareto chart and cause-and-effect chart. Multivariate control charts.
    2. IQCC (R package): Statistical control charts with exact limits for univariate and multivariate cases.
    3. QUALITYTOOLS (R package): DMAIC, normal and non-normal process capability indices, Gage Repeatability, factorial and fractional factorial designs, Pareto Chart; Measure - Probability and Quantile-Quantile Plots, Process Capability Indices etc.
  2. Sixsigma
    1. Sixsigma (R package):This package contains functions and utilities to perform Statistical Analyses in the Six Sigma way. Through the DMAIC cycle (Define, Measure, Analyze, Improve, Control), Gage R&R, Capability Analysis, Control Charts, Loss Function Analysis, etc.

PROGRAM - 7: R for Medical Image & Analysis.

(15 hours; available only as workshop; offered at clients place)

  1. Magnetic resonance imaging (MRI): diffusion tensor imaging (DTI), Dynamic contrast - enhanced MRI (DCE - MRI), functional connectivity, functional MRI, structural MRI, visualization, simulation of time series and 4D data.
  2. General image processing.
  3. Positron Emission Tomog
  4. Digital imaging and communications in medicine (DIACOM)
  5. raphy (PET)
  6. Electroencephalography (EEG).

PROGRAM 8: Big data analytics through R language

Duration: 5 hours

  1. Explicit parallelism
  2. Implicit parallelism
  3. Grid computing
  4. Hadoop
  5. Random numbers
  6. GPU computing
  7. Resource (memory) management

PROGRAM - 9: R for Programmers (To be updated)



All above programs are available both on-line and off-line mode. Write to or for more details. Corporate clients can write to or

The programs are customizable. We can improvise contents as per the needs of the aspirants. Trainees can choose contents as per their individual (research) needs. Please feel free to write to aforementioned mails mentioning the changes.


Social Research Insights is the first academy to introduce R Language in Academics in India