PDF Version Available
This document is also available in PDF format: week1.pdf
The PDF version includes bookmarks for easy navigation and is optimized for printing.
Accessibility Notice
This document is also available in HTML format at:
https://aholdengouveia.name/AdvData/labs/week1.html
The HTML version provides enhanced accessibility features including keyboard navigation, screen reader support, responsive design, dark mode support, and high contrast options.
Objectives:
- Learn about the difference between dataset, and how to evaluate a dataset for quality
Complete the following problems
References, a video, a PowerPoint and some notes are available at my website https://www.aholdengouveia.name/AdvData/IntroData.html
- Pick 3 datasets, you may use your own resources or pull from the listed resources. Make sure to list 2 pros and 2 cons, minimum for each dataset. Resources - https://www.aholdengouveia.name/AdvData/Resources.html
- Watch one of listed the TED talks and give it a 1 page review itemize
- https://www.ted.com/talks/madhumita_murgia_how_data_brokers_sell_your_identity/transcript?language=en
- https://www.ted.com/talks/sebastian_wernicke_how_to_use_data_to_make_a_hit_tv_show/transcript?language=en
- https://www.ted.com/talks/finn_lutzow_holm_myrstad_how_tech_companies_deceive_you_into_giving_up_your_data_and_privacy/transcript?language=en
- https://www.ted.com/talks/hans_rosling_let_my_dataset_change_your_mindset/transcript?language=en
- https://www.ted.com/talks/mainak_mazumdar_how_bad_data_keeps_us_from_good_ai/nscript
- https://ideas.ted.com/how-open-government-data-creates-smarter-societies/
Deliverables
- Text document of your TED talk review
- Document for your datasets, include a pro/con list for each one. Don't forget to include a link to your dataset.