Chevron Left
返回到 Getting and Cleaning Data

学生对 Johns Hopkins University 提供的 Getting and Cleaning Data 的评价和反馈

4.5
8,070 个评分

课程概述

Before you can work with data you have to get some. This course will cover the basic ways that data can be obtained. The course will cover obtaining data from the web, from APIs, from databases and from colleagues in various formats. It will also cover the basics of data cleaning and how to make data “tidy”. Tidy data dramatically speed downstream data analysis tasks. The course will also cover the components of a complete data set including raw data, processing instructions, codebooks, and processed data. The course will cover the basics needed for collecting, cleaning, and sharing data....

热门审阅

HS

May 2, 2020

This course provides an introduction of some important concepts and tools on a very important aspect of data science: cleaning and organizing data before any analysis. A must for any data scientist.

BD

Oct 25, 2016

This course is really a challenging and compulsory for any one who wants to be a data scientist or working in any sort of data. It teaches you how to make very palatable data-set fro ma messy data.

筛选依据:

1126 - Getting and Cleaning Data 的 1150 个评论(共 1,313 个)

创建者 Mike

Oct 22, 2018

I think some concepts could have been taught better with simple examples first, and then gradually move to more complex ones, but using noisy data blur the learning objective , and again... the instructors are just showing up a slide.. I think the power of video and illustrations could have been better utilized

创建者 Cameron L

Apr 22, 2022

The course introduces many good packages and skillsets, but doesn't really instruct on their use. The work typically requires extra outside learning to complete. The R courses in the data track have typically been much better than this specific instance, but it is required to complete that track.

创建者 Brett C

Aug 12, 2022

Not bad, but as a number of other reviewers have said, the course material is out of date. There are a couple of places where the packages discussed are no longer available, the functions taught are deprecated, or where there's additional installation steps required for the newest versions.

创建者 Allyson D d L

Nov 4, 2021

The course is good to learn more R commands but only in the last week there is a practical assignment. I think if all weeks could have practical assignments this course would be excellent. In this assignment we don't use all the commands that we learnt. So, this course has a lot to improve.

创建者 Debjit C

Jul 6, 2020

I had a very interesting experience in the course. Thanks to all the help from the discussion forum and data science communities such as StackOverflow . They have the best resources to learn.

The assignments were a bit difficult to understand but once understood, it was quiet easy to solve.

创建者 Bruno V

Jun 22, 2020

The course is good but should update its links and go deep into the regex syntax. Moreover, the tasks of the assignment was not difficult. However, it was not easy to understand the tasks as they were not well explained/written. Overall the course is good and I recommend it.

创建者 Kelechi M A

Feb 13, 2022

There is a huge gap between what is touched on in the lectures and the project. The upside is that it shines light on what the student should do further research and study on. The downside is it almost becomes unwise and a waste of time to continue with Coursera.

创建者 Ryan B

Apr 20, 2020

Learned some very useful skills, but I found that some of the weeks moved too quickly without sufficiently explaining the background information required (as someone without a data science background) with abstract concepts that were not grounded in application.

创建者 FARROUK_ABDERRAHIM B

Oct 12, 2020

the assignment project was hard and really not enough instruction was given and it was a machine learning data set which made it very hard :) i mean we hadnt seen anythin similar to that during courses :) fix that and change project assignment for final week

创建者 Mark P

Jul 12, 2021

The course give very broad overviews in the lectures, then drops very difficult questions in the quiz and assiagnments. It is good to push a little and make you dig for solutions on the internet, but the jump in difficulty is too far to make it worthwhile.

创建者 Carlos M C D

Feb 8, 2016

The course is good, but it doesn't really offer all the tools required to pass the exams. I had to take extra courses in other place in order to pass. In addition, the exams some times become a bit too subjective of what the classmates want to grade you.

创建者 Cian O

Jul 9, 2024

Gave good lectures on the methods used to get and clean data. The main downside was that the data and methods used was often outdated, and newer methods now exist. Still gives a good understanding of what you should aim for, and how to achieve it.

创建者 Bangda S

Nov 9, 2016

This course provides a lot of methods and strategies about reading data, manipulating data. But I think some important issues in the real world are not discussed enough here, like how to treat missing values, how to deal with messy format data.

创建者 Efe Y

Jan 20, 2021

Had a lot of trouble accessing and downloading datasets from the internet despite I were using the same source codes. Beside teaching how to download data from internet, it would be great if datasets were also included in the course content.

创建者 Dominic H

May 27, 2018

You will learn valuable tools, techniques and concepts but be prepared to feel overwhelmed (if you have no computer science background whatsoever) by quizzes and the assignment which require you to do research stuff outside of this course.

创建者 sunsik k

Jul 18, 2017

Quite disappointed at 'Getting data' part because of lack of explanation(I only had to learn extra sources to understand) but satisfied with 'Cleaning data' part. It would have been more useful if course described how to use GitHub, at the

创建者 Fabiana G

Jun 23, 2016

The content of the course is good, but it seems abandoned - some links are outdated or don't work. I think it would be a much better experience for students if these first courses in the specialization got more love from the instructors.

创建者 Steve W

Feb 3, 2016

The lecture material was high level, and didn't seem to be a good preparation for the quizzes.

The description for the final project was not very detailed, and the grading rubric likewise was not very specific for peer review.

创建者 Andrew G

Jul 28, 2018

I thought the course project grading was supposed to focus on what we learned in class, not almost entirely on creating readme and codebook files. Also, the explanation of what was expected for the project was NOT CLEAR,

创建者 Wentao B

Apr 2, 2016

The content of the course is too general, with too brief introduction of some commands in the lecture notes(slides), I don't think it would be very helpful for the students to deal with some real complicated problems.

创建者 Justin z

Apr 12, 2017

brought up some good concept inside, like "tidy data", but not in detail, how to grab data from different source shouldn't be difficult. should have more focus on talking about data.table, "tidy data" principles etc.

创建者 Bekhzod A

Mar 13, 2016

Hi all,

Course provides interesting insight to getting and cleaning data. However, the course misses practical examples (not only showing the code in the slides, but also presenting how it works in R or RStudio).

创建者 Ehab H A

Feb 4, 2019

This course was too hard for me compared to the first two in the program. Not sure whether it is because of my limited background in the subject area, or because of the abrupt shift in level from course 2 to 3.

创建者 Sven B

Apr 30, 2016

This course is of lower quality than the preceding courses. The final assignment instructions are not clear. The forums helped but I have the impression that they are not really followed by the mentors.

创建者 Calvin l

Oct 4, 2023

Sometimes difficult to follow, and the content isn't really relevant to what I want to learn, considering I am not particularly interested in professional data science. Took about 2 hours to finish.