Error occured ! We are notified and will try and resolve this as soon as possible.
WARNING! [2] file_put_contents(/home/myedu/domains/myeducationpath.com/app/../html/cache/memory/course_22750_0_e086762d743c0218beb85ea6e1b456cae.txt): Failed to open stream: No such file or directory . Line 75 in file /home/myedu/domains/myeducationpath.com/html/include/class.cache.php. Continue execution. 1673017; index.php; 3.133.131.168; GET; url=courses/22750/distributed-machine-learning-with-apache-spark.htm&; ; Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com); ; Executon time: 0 MyEducationPath.com :: edX : Distributed Machine Learning with Apache Spark

Distributed Machine Learning with Apache Spark

0 votes
Free Closed [?]
Distributed Machine Learning with Apache Spark

Machine learning aims to extract knowledge from data, relying on fundamental concepts in computer science, statistics, probability and optimization. Learning algorithms enable a wide range of applications, from everyday tasks such as product recommendations and spam filtering to bleeding edge applications like self-driving cars and personalized medicine. In the age of ‘big data’, with datasets rapidly growing in size and complexity and cloud computing becoming more pervasive, machine learning techniques are fast becoming a core component of large-scale data processing pipelines.



This statistics and data analysis course introduces the underlying statistical and algorithmic principles required to develop scalable real-world machine learning pipelines. We present an integrated view of data processing by highlighting the various components of these pipelines, including exploratory data analysis, feature extraction, supervised learning, and model evaluation. You will gain hands-on experience applying these principles using Spark, a cluster computing system well-suited for large-scale machine learning tasks, and its packages spark.ml and spark.mllib. You will implement distributed algorithms for fundamental statistical models (linear regression, logistic regression, principal component analysis) while tackling key problems from domains such as online advertising and cognitive neuroscience.


Categories:
Starts : 2016-07-11

Comments

Alternatives

-- no alternatives found for the course --
If you know any alternatives, please let us know.

Prerequisites

-- no prerequsites found for the course --
If you can suggest any prerequisite, please let us know.

Paths

No Paths inclusing the course. You can build and share a path with this course included.

Certification Exams

-- there are no exams to get certification after this course --
If your company does certification for those who completed this course then register your company as certification vendor and add your exams to the Exams Directory.

Let us know when you did the course Distributed Machine Learning with Apache Spark.

Started on: Completed on:
Your grade (if any):
Comments:

Add the course Distributed Machine Learning with Apache Spark to My Personal Education Path.

Start the course on:
Duration of study:
Notes:

Successfully added to your path.

View your path

Select what exam to connect to the course. The course will be displayed on the exam page in the list of courses supported for certification with the exam.


Notes about how the exam certifies students of the course (optional):