Budete přesměrováni na web Pumpedu
Přehled kurzu
O kurzu
Školení dodává autorizovaný distributor DNS a.s.
The main focus areas cover Hadoop (including Pig, Hive, and HBase), Natural Language Processing, Social Network Analysis, Simulation, Random Forests, Multinomial Logistic Regression, and Data Visualization.
Taking an "Open" or technology-neutral approach, this course utilizes several open-source tools to address big data challenges.
Co se naučíte
Požadavky
- Completion of the Data Science and Big Data Analytics course
- Proficiency in at least one programming language such as Java or Python
Osnova kurzu
Lesson 1: The MapReduce Framework
- Lesson 2: Apache Hadoop
- Lesson 3: Hadoop Distributed File System
- Lesson 4: YARN
- Lesson 1: Hadoop Ecosystem
- Lesson 2: Pig
- Lesson 3: Hive
- Lesson 4: NoSQL - Not Only SQL
- Lesson 5: HBase
- Lesson 6: Spark
- Lesson 1: Introduction to NLP
- Lesson 2: Text Preprocessing
- Lesson 3: TFIDF
- Lesson 4: Beyond Bag of Words
- Lesson 5: Language Modeling
- Lesson 6: POS Tagging and HMM
- Lesson 7: Sentiment Analysis and Topic Modeling
- Lesson 1: Introduction to SNA and Graph Theory
- Lesson 2: Most Important Nodes
- Lesson 3: Communities and Small World
- Lesson 4: Network Problems and SNA Tools
- Lesson 1: Simulation
- Lesson 2: Random Forests
- Lesson 3: Multinomial Logistic Regression
- Lesson 1: Perception and Visualization
- Lesson 2: Visualization of Multivariate Data Module
Certifikace
This training prepares the learner for Dell Technologies Proven Professional advanced analytics specialist-level certification exam (E20-065).
Pro koho je kurz vhodný?
- This course is intended for aspiring Data Scientists, data analysts that have completed the associate level Data Science and Big Data Analytics course, and computer scientists wanting to learn MapReduce and methods for analyzing unstructured data such as text.