返回到 Introduction to Big Data
University of California San Diego

Introduction to Big Data

Interested in increasing your knowledge of the Big Data landscape? This course is for those new to data science and interested in understanding why the Big Data Era has come to be. It is for those who want to become conversant with the terminology and the core concepts behind big data problems, applications, and systems. It is for those who want to start thinking about how Big Data might be useful in their business or career. It provides an introduction to one of the most common frameworks, Hadoop, that has made big data analysis easier and more accessible -- increasing the potential for data to transform our world! At the end of this course, you will be able to: * Describe the Big Data landscape including examples of real world big data problems including the three key sources of Big Data: people, organizations, and sensors. * Explain the V’s of Big Data (volume, velocity, variety, veracity, valence, and value) and why each impacts data collection, monitoring, storage, analysis and reporting. * Get value out of Big Data by using a 5-step process to structure your analysis. * Identify what are and what are not big data problems and be able to recast big data problems as data science questions. * Provide an explanation of the architectural components and programming models used for scalable big data analysis. * Summarize the features and value of core Hadoop stack components including the YARN resource and job management system, the HDFS file system and the MapReduce programming model. * Install and run a program using Hadoop! This course is for those new to data science. No prior programming experience is needed, although the ability to install applications and utilize a virtual machine is necessary to complete the hands-on assignments. Hardware Requirements: (A) Quad Core Processor (VT-x or AMD-V support recommended), 64-bit; (B) 8 GB RAM; (C) 20 GB disk free. How to find your hardware information: (Windows): Open System by clicking the Start button, right-clicking Computer, and then clicking Properties; (Mac): Open Overview by clicking on the Apple menu and clicking “About This Mac.” Most computers with 8 GB RAM purchased in the last 3 years will meet the minimum requirements.You will need a high speed internet connection because you will be downloading files up to 4 Gb in size. Software Requirements: This course relies on several open-source software tools, including Apache Hadoop. All required software can be downloaded and installed free of charge. Software requirements include: Windows 7+, Mac OS X 10.10+, Ubuntu 14.04+ or CentOS 6+ VirtualBox 5+.

状态:Data Science
状态:Unstructured Data
课程小时

精选评论

MR

5.0评论日期:Dec 2, 2020

An excellent introduction to Hadoop and MapReduce, for those who know nothing about these things. The course uses a virtual machine, which means you don't need to install Hadoop on your own machine.

VJ

4.0评论日期:Oct 26, 2020

Hadoop commands were from the old version whereas there are new versions command also there however the content of the course was very much interactive and interesting and made the learning easy.

KN

5.0评论日期:Jun 1, 2020

This is a great course. I learnt a lot from this course. What I like about this course is the hands-on experience with Hadoop. Such a good add-on on our skill, instead of just theoretical learning.

HM

5.0评论日期:Sep 8, 2019

I love the course. It goes deep into the foundations, and then finishes up with an actual lab where you learn by practice. I greatly benefited from it and feel I have achieved a milestone in big data.

AR

5.0评论日期:Mar 30, 2020

One of the best course to start learning new cutting-edge technology and to get deeper insights into Big Data. Thanks to the great instructors for amazing explanations of each module and e-materials.

PB

5.0评论日期:May 24, 2018

A step by step approach stating from basic big data concept extending to Hadoop framework and hands on mapping and simple MapReduce application development effort.Very smooth learning experience.

JT

5.0评论日期:Aug 30, 2016

This is a great introduction for Big Data. It helps me to revisit what I learned from the meetups and webinars, then put the fundamental knowledge and information in a solid foundation. Thank you.

RG

5.0评论日期:Jul 13, 2017

First of all i would like to take this opportunity to thanks the instructors the course is well structured and explained the foundations with real world problems with easy to understand the concepts.

AJ

5.0评论日期:Dec 21, 2020

Having a great and deep start about Big Data. this course helps me understand how big data is very useful to solve real-world problems before it happens or that situation was where a problem occurs.

SS

5.0评论日期:Sep 14, 2019

It is a comprehensive introduction to big data which covers significant components with enough content that can be absorb at this stage. A very good kick-start and excited for the next course ahead.

AK

5.0评论日期:Aug 8, 2021

This course was really helpful in basic understanding of big Data and Instructor's appearance on screen was like we are taking course in classroom session which made the training more interesting

VG

5.0评论日期:Mar 25, 2018

Excellent learning opportunity to the concepts of Big Data and about the Hadoop ecosystem. Overall a wonderful learning experience with hands-on to get practical knowledge on the concepts learnt

所有审阅

显示:20/2,500

Deleted Account
5.0
评论日期:May 10, 2022
Isara Anantavrasilp
3.0
评论日期:Oct 4, 2018
Abdul Kittana
1.0
评论日期:Sep 26, 2016
Rakesh Gopidi
5.0
评论日期:Jul 14, 2017
Prabir Bhattacharyya
5.0
评论日期:May 25, 2018
Patricia peñalosa
2.0
评论日期:Nov 7, 2018
Catherine Boothman
5.0
评论日期:Nov 8, 2018
Raivis Joksts
3.0
评论日期:Feb 5, 2019
hatem murad
5.0
评论日期:Sep 9, 2019
Hendrik Bruns
3.0
评论日期:Dec 1, 2017
Alexandra Hazard Kampmann
1.0
评论日期:Apr 9, 2017
T Bizreh
4.0
评论日期:Feb 23, 2020
Guy Dupenloup
1.0
评论日期:Feb 6, 2021
Sean Green
1.0
评论日期:Sep 24, 2016
Pranav Vyas
1.0
评论日期:Mar 7, 2019
Mayank Raj
1.0
评论日期:Oct 2, 2019
Rongon Chatterjee
4.0
评论日期:May 13, 2019
Ahmed Khalifa Ahmed Alhammadi
5.0
评论日期:Jun 9, 2019
Azar Rzayev
5.0
评论日期:Mar 31, 2020
Brian Song
4.0
评论日期:May 8, 2022