...
Slides
...
...
- Datacamp.com: Good, beginner-friendly and interactive courses on R (and Python). Very recommended - also check out my own course :-)
KEV
Kontext:
Datensatz (Ausschnitt):
https://docs.google.com/spreadsheets/d/1c6A356Eg9zDX3dkYGGvDJuzfpmSgHnp-Zf7ucSodYRA/edit?usp=sharing
Probiert folgende Auswertungen mit den Daten anzustellen?
Welche Schritte braucht es dafür?
Welche Abstriche muss man bei der Auswertung machen?
- Durchschnittliche Vertragslänge (von Inbetriebnahme bis Vertragsende)
- Gesamtstromproduktion nach Kanton
- Gesamtsumme der Geldbezüge für einerseits Herren, andererseits Frauen
- Gesamtsumme der jeweils ersten Zahl in der Strassennummer, nach Kanton und Energieträger.
Exercise solutions day 1
...
- Sometimes, data is accessible via an API
- The preferred data format of APIs is JSON
- JSON can be converted into CSV
- The preferred way of talking to an API is with code
- Browse https://developer.spotify.com/console/get-search-item/ to get his ID: 7uxtLjuqkJ3cnjQQuW6Cul
- Browse https://developer.spotify.com/console/get-artist-top-tracks/, fill in values and get JSON data
- Copy and paste into https://json-csv.com/
- Download as Excel - do the math (36.60705 min)
- Take-Aways:
...
- Browse https://www.tecson-data.ch/zurich/tiefenbrunnen/index.php (as probably shown on Google)
- Select “windchill”, 2.11.2018/7.11.2018 and “all values” at the very bottom
- Copy stuff into Excel by hand and calculate median
- Browse https://tecdottir.herokuapp.com/docs/#/measurements
- Enter parameters
- Copy curl string and pipe into a file
- Upload JSON and paste into https://json-csv.com/ (bonus: use matrix style)
- Download CSV, open in Excel and calculate median (don’t forget to filter unneeded dates)
...
- )
...
- First approach: Scraping data
- Second approach: Open Data Zürich / API:
- Take-Aways:
...
- Many interesting data are buried in PDFs
- Use proprietary software or Tabula to extract the data
...
- .
...