Data

Books

Websites

Podcasts

Videos

Articles

On ethics

On color

On interactivity

Tools

A variety of useful toolkits have been designed to help support information visualization applications. Some include support for the full visualization pipeline from data to interactive graphics, while others focus only on a subset, typically graphics and interaction.

Visualization Cheatsheets

Visualization Toolkits

Visualization Tools

Visualization Programming Environments

Network Analysis Tools

Color Tools

Data Literacy Resources

Slides

Day 1: https://docs.google.com/presentation/d/e/2PACX-1vSq7ytkZLPS-Nbs5JYbeKG8EuYotCxRgUewmbgWWWCqGIV-KUX4AXCa-_5bbdYNilRd46n6p3F9IbmT/pub?start=false&loop=false&delayms=3000

General / good reads

Data quality

Data formats / conversions

Regex

Geospatial data

Working with the CLI

The command line interface (CLI, often referred to as "shell" or "terminal") is a very powerful tool, and each operating system has one. Working with the CLI is easiest on Linux and Mac, and they are both similar since they are both based on Unix. I really recommend getting to know the 101 of working with the terminal, e.g. through this tutorial. Here are some more tips for working with data on the CLI. This Twitter account gives useful and sometimes funny tips on how to make the most of the terminal.

Data processing in R

Exercises


KEV


Kontext:

https://www.srf.ch/news/schweiz/abstimmungen/abstimmungen/energiestrategie/die-profiteure-der-energiewende


Datensatz (Ausschnitt):

https://docs.google.com/spreadsheets/d/1c6A356Eg9zDX3dkYGGvDJuzfpmSgHnp-Zf7ucSodYRA/edit?usp=sharing


Sind folgende Auswertungen / Applikationen mit dem Datensatz möglich?


  1. Durchschnittliche Vertragslänge (Inbetriebnahme bis Vertragsende)
  2. Gesamtstromproduktion nach Kanton
  3. Gesamtsumme der Geldbezüge für einerseits Herren, andererseits Frauen
  4. Durchschnittliche Länge der Strassennamen nach Kanton
  5. Karte mit Standorten der KEV-Bezüger (= ein Punkt-Symbol für jeden genauen Standort, ähnlich wie in diesem Artikel)


Exercise solutions day 1

  1. Manillio:
    1. Browse https://developer.spotify.com/console/get-search-item/ to get his ID: 7uxtLjuqkJ3cnjQQuW6Cul
    2. Browse https://developer.spotify.com/console/get-artist-top-tracks/, fill in values and get JSON data
    3. Copy and paste into https://json-csv.com/
    4. Download as Excel - do the math (36.60705 min)
    5. Take-Aways:
      1. Sometimes, data is accessible via an API
      2. The preferred data format of APIs is JSON
      3. JSON can be converted into CSV
      4. The preferred way of talking to an API is with code
  2. Wasserstation Tiefenbrunnen
    1. First approach: Scraping data
      1. Browse https://www.tecson-data.ch/zurich/tiefenbrunnen/index.php (as probably shown on Google)
      2. Select “windchill”, 2.11.2018/7.11.2018 and “all values” at the very bottom
      3. Copy stuff into Excel by hand and calculate median
    2. Second approach: Open Data Zürich / API:
      1. Browse https://tecdottir.herokuapp.com/docs/#/measurements
      2. Enter parameters
      3. Copy curl string and pipe into a file
      4. Upload JSON and paste into  https://json-csv.com/ (bonus: use matrix style)
      5. Download CSV, open in Excel and calculate median (don’t forget to filter unneeded dates)
    3. Take-Aways:
      1. Copying and pasting stuff from HTML tables should be avoided
      2. Always look out for an API
      3. Try out different settings of your tools - they might bring you better results (“matrix style”)
      4. Get to know the terminal
      5. Excel / LibreOffice / OpenOffice have some good filters: get to know how to use them
      6. If you run out of queries, delete cookies
  3. Schlichtungsverfahren
    1. Google it and go to https://www.bwo.admin.ch/bwo/de/home/mietrecht/schlichtungsbehoerden/statistik-der-schlichtungsverfahren.html
    2. Download first PDF
    3. Download Tabula and launch, upload PDF (or use Adobe Reader DC)
    4. Select last table, lattice extraction format
    5. Download as CSV
    6. Open in LibreOffice and make chart
    7. Take-Aways:
      1. Many interesting data are buried in PDFs
      2. Use proprietary software or Tabula to extract the data