Analyzing Big Data with Microsoft R Server

0 votes

Free Closed [?]

This course is part of the Microsoft Professional Program Certificate in Big Data, and the Microsoft Professional Program Certificate in Data Science.

The open-source programming language R has for a long time been popular (particularly in academia) for data processing and statistical analysis. Among R's strengths are that it's a succinct programming language and has an extensive repository of third party libraries for performing all kinds of analyses. Together, these two features make it possible for a data scientist to very quickly go from raw data to summaries, charts, and even full-blown reports. However, one deficiency with R is that traditionally it uses a lot of memory, both because it needs to load a copy of the data in its entirety as a data.frame object, and also because processing the data often involves making further copies (sometimes referred to as copy-on-modify). This is one of the reasons R has been more reluctantly received by industry compared to academia.

The main component of Microsoft R Server (MRS) is the RevoScaleR package, which is an R library that offers a set of functionalities for processing large datasets without having to load them all at once in the memory. RevoScaleR offers a rich set of distributed statistical and machine learning algorithms, which get added to over time. Finally, RevoScaleR also offers a mechanism by which we can take code that we developed on our laptop and deploy it on a remote server such as SQL Server or Spark (where the infrastructure is very different under the hood), with minimal effort.

In this course, we will show you how to use MRS to run an analysis on a large dataset and provide some examples of how to deploy it on a Spark cluster or a SQL Server database. Upon completion, you will know how to use R for big-data problems.

Since RevoScaleR is an R package, we assume that the course participants are familiar with R. A solid understanding of R data structures (vectors, matrices, lists, data frames, environments) is required. For example, students should be able to confidently tell the difference between a list and a data frame, or what each object is generally a good representation for and how to subset it. Students should be familiar with basic programming concepts such as control flows, loops, functions and scope. Students should have a good understanding of how to write and debug R functions. Finally, students are expected to have a good understanding of data manipulation and data processing in R (e.g. functions such as merge, transform, subset, cbind, rbind, lapply, apply). Familiarity with 3rd party packages such as dplyr is also helpful.

Tags: BusinessNutritionOnline sap training

Details: https://www.edx.org/course/a...r-server-microsoft-dat213x-1

Language: English

Starts : 2017-07-01

edX

Comments

Reviews

-- no reviews yet -- make the first review

Add review [?] Discuss on the Forum

Alternatives
All alternatives to all edX courses

-- no alternatives found for the course --
If you know any alternatives, please let us know.

Add alternative course [?]

Prerequisites

-- no prerequsites found for the course --
If you can suggest any prerequisite, please let us know.

Add prerequisite [?]

Paths

No Paths inclusing the course. You can build and share a path with this course included.

Certification Exams

-- there are no exams to get certification after this course --
If your company does certification for those who completed this course then register your company as certification vendor and add your exams to the Exams Directory.

Add My Exams

Similar courses

Courses related to the course subject

-- no similar courses found --

Let us know when you did the course Analyzing Big Data with Microsoft R Server.

Add the course Analyzing Big Data with Microsoft R Server to My Personal Education Path.

Successfully added to your path.

View your path

Select what exam to connect to the course. The course will be displayed on the exam page in the list of courses supported for certification with the exam.

Course name *:
Description *:
Course web page *:
Course provider name *:
Course provider web site *:
Course price:
* - required

Started on:		Completed on:
Your grade (if any):
Comments:


		Use the filter to find a course from courses directory to suggest it as alternative. Or click "Suggest a course not listed on this site" to add a courses not listed on this site.

Start the course on:
Duration of study:
Notes: