+ - 0:00:00
Notes for current slide
Notes for next slide

Visualizing probability distributions across
bivariate cyclic temporal granularities


Sayani Gupta
Sayani07     SayaniGupta07
https://sayanigupta-ows2020.netlify.com/


Time series analysis and econometrics
Bernoulli-IMS One World Symposium 2020
August 24, 2020 - August 28, 2020

1 / 14



Joint work with Professor Dianne Cook and Rob Hyndman

Department of Econometrics and Business Statistics
Monash University
Sayani.Gupta@monash.edu


1 / 14

Motivation: Smart meter data for households

2 / 14

Linear to cyclic

  • Cyclic time granularities: exploring different periodicities e.g. hour-of-day, day-of-month or hour-of-week, day-of-semester

  • Multiple observations for each level of cyclic granularity

  • summarize distribution of measured variables

3 / 14

Summarize distribution

  • Cyclic time granularities: exploring different periodicities e.g. hour-of-day, day-of-month or hour-of-week, day-of-semester

  • Multiple observations for each level of cyclic granularity

  • summarize distribution of measured variables

4 / 14

Data structure and graphical mapping



  • extension of tsibble data structure
  • choose any two cyclic granularities: Ci={A1,A2,,AK} and Cj={B1,B2,,BL}
  • graphical mapping (Ci,Cj,v)

  • NCP2 displays

5 / 14

Relationship of cyclic granularities

Clashes: pairs leading to empty sets

Harmonies: pairs leading to no empty sets

Still too many harmonies for display for large NC

6 / 14

Selecting harmonies with significant variation

  • significant differences between categories:
    (a) no (b) yes
  • Gestalt theory
  • efficiently capture within and between group variation
  • compute threshold
7 / 14

Rank selected harmonies
Median Maximum Pairwise Distance (MMPD)

  • rank harmonies (b) > (a)
  • distances between probability distribution through Jensen-Shannon divergences
  • normalizing constants through Fisher–Tippett–Gnedenko theorem
8 / 14

R package gravitas

  • construction of cyclic granularities and cyclic calendar algebra
  • check feasibility through harmony/clash
  • select significant harmonies and rank them
  • recommend prospective probability distributions
9 / 14

An example : Electricity smart meter data

Data source : Department of the Environment and Energy, Australia

## # A tsibble: 293,874 x 3 [30m] <UTC>
## # Key: customer_id [10]
## customer_id reading_datetime general_supply_kwh
## <chr> <dttm> <dbl>
## 1 10006414 2012-02-10 08:00:00 0.141
## 2 10006414 2012-02-10 08:30:00 0.088
## 3 10006414 2012-02-10 09:00:00 0.078
## 4 10006414 2012-02-10 09:30:00 0.151
## 5 10006414 2012-02-10 10:00:00 0.146
## 6 10006414 2012-02-10 10:30:00 0.077
## 7 10006414 2012-02-10 11:00:00 0.052
## 8 10006414 2012-02-10 11:30:00 0.055
## 9 10006414 2012-02-10 12:00:00 0.055
## 10 10006414 2012-02-10 12:30:00 0.252
## # … with 293,864 more rows
10 / 14

Cyclic granularities considered for analysis

## [1] "hour_day" "hour_week" "hour_month" "day_week" "day_month"
## [6] "week_month" "wknd_wday"



Number of displays to analyze for exhaustive exploration

There are 7P2=42 pair of granularities to look at placing one on the x-axis and the other on facet.

11 / 14

Select only harmonies (16 out of 42 selected)

facet x-axis facet levels x levels MMPD
day_week day_month 7 31 0.064
wknd_wday day_month 2 31 0.060
wknd_wday hour_day 2 24 0.044
day_week hour_day 7 24 0.024
week_month hour_day 5 24 0.023
hour_day day_month 24 31 0.016
day_month wknd_wday 31 2 0.014
day_month day_week 31 7 0.011
day_month hour_day 31 24 0.009
hour_day wknd_wday 24 2 0.009
week_month wknd_wday 5 2 0.007
day_week week_month 7 5 0.004
wknd_wday week_month 2 5 0.003
hour_day week_month 24 5 0.003
week_month day_week 5 7 0.001
hour_day day_week 24 7 0.001
12 / 14

Select and rank significant harmonies (6 out of 16 selected)

13 / 14

Thank you

Slides: https://sayanigupta-ows2020.netlify.app/

Materials: https://github.com/Sayani07/OWS2020

Package (CRAN): v0.1.3

Package (Github) : https://github.com/Sayani07/gravitas


Slides created with Rmarkdown, knitr, xaringan, xaringanthemer

14 / 14



Joint work with Professor Dianne Cook and Rob Hyndman

Department of Econometrics and Business Statistics
Monash University
Sayani.Gupta@monash.edu


1 / 14
Paused

Help

Keyboard shortcuts

, , Pg Up, k Go to previous slide
, , Pg Dn, Space, j Go to next slide
Home Go to first slide
End Go to last slide
Number + Return Go to specific slide
b / m / f Toggle blackout / mirrored / fullscreen mode
c Clone slideshow
p Toggle presenter mode
t Restart the presentation timer
?, h Toggle this help
Esc Back to slideshow