SomeSegSimu/Simulation_PerfStat.Rmd at master · guillemr/SomeSegSimu · GitHub

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
---
title: "Comparison of Approaches to Detecting Change-in-mean"
author: "Guillem Rigaill and Paul Fearnhead"
date: "10 february 2020"
output:
  pdf_document: default
  html_document: default
---


## Overview

This document provides extensive simulation results comparing a range of different methods for detecting change-in-mean in a univariate data. These have been implemented in a way such that it should be possible to extend the comparison to other data scenarios; new methods; or different measures of performance. Full code is available at:
https://github.com/guillemr/SomeSegSimu

The simulation study is implemented by (i) specifying details for each data scenario; (ii) specifying functions that implement each changepoint detection method; and (iii) specifying functions that input the true and estimated segmentation and measure its accuracy. We first give examples of each of these types of functions, and then present results comparing the methods.”

```{r setup, include=FALSE}
#rm(list=ls())
#knitr::opts_chunk$set(echo = TRUE)
options(width = 100)
library(parallel)
library(data.table)
library(ggplot2)
library(cowplot)
library(gridExtra)
library(reshape2)

## number of simu per dataset
nrepeat <- 300
mc.cores <- 45
set.seed(10022020)
```

## Specifying a Data Scenario

Each data scenario is defined by an S4 class DatasetDesc.
This class defines four features of the data scenario:

- the name of the scenario;
- a vector (bkp) that specifies the changepoints. This vector must start with 0 and end with the last data point, n;
- a vector (mu) that species the mean within each of the segments
- the standard deviation of the noise (sigma)


From these we derive other relevant parameter such as the segment length, the signal length, the signal and others using the complete function.

Here is the code for our first dataset.
All other datasets are in the R file InitialDatasets.R.


```{r dataset_initial}
source("DatasetDesc.R")
initialDatasetDesc <- list()
i <- 1
initialDatasetDesc[[i]] <- new("DatasetDesc",
        Name  = "Dt1",
        bkp   = as.integer(c(0, 204, 266, 307, 471, 511, 819, 901, 1331, 1556, 1597, 1658, 2048)),
        mu    = c(0, 14.64, -3.66, 7.32, -7.32, 10.98, -4.39, 3.29, 19.03,7.68, 15.37, 0),
        sigma = 10
)
i <- i+1
source("InitialDatasets.R")
initialDatasetDesc <- lapply(initialDatasetDesc, complete)
```

## Simulating datasets

The dataDesc S4 class has a function to simulate a profile (simulation).

Three options are available for the simulation :

- a i.i.d Gaussian : "Gauss"

- a i.i.d Student of degree 10 : "Stud"

- an AR with an auto-correlation of 0.3 : "ARMA"

Here we plot an example profile for each dataset.

```{r show, message=F, results="hide", message=F, fig.height=20}

allplots <- lapply(initialDatasetDesc, FUN=function(dataDesc){
  y <- simulation(dataDesc)
  dataY <- data.frame(x=1:length(y), y=y, smt=dataDesc@signal)
  plot_data <- ggplot(dataY, aes(x=x, y=y)) +
    geom_point(size=0.2) + geom_line(aes(x=x, y=smt, color="red")) +
    geom_vline(xintercept = dataDesc@bkp, color="blue") +
    theme(legend.position = "none", plot.title = element_text(hjust = 0.5)) +
    ggtitle(paste0("Dataset: ", dataDesc@Name))
  plot_data

})

lay <- rbind(1:2, 3:4, 5:6,  rep(7, 2), rep(8, 2), rep(9, 2), 10:11)
grid.arrange(allplots[[1]], allplots[[2]],
             allplots[[3]], allplots[[4]],
             allplots[[5]], allplots[[6]],
             allplots[[7]], allplots[[8]],
             allplots[[9]], allplots[[10]],
             allplots[[11]],
             layout_matrix=lay)
```

## Extension of the Datasets

For each scenario we provide two ways of changing aspects of the data:

1. Varying signal strength: We can multiply the standard deviation by a factor. This is equivalent (after re-scaling) to scaling the change-in-mean at each changepoint.

2. Varying data length: We multiply each segment length by a constant factor and divide the standard deviation by the square root of this factor. This ensure that as we vary the data set size the signal strength for each change is kept roughly constant.

This is done using the scaleDesc and expandDesc of the DataDesc class.

```{r dataset_expanding, fig.width=10, fig.height=5}

scaledDatasetDesc <- unlist(
  lapply(initialDatasetDesc, FUN=function(dataDesc){
    lapply(c(0.2, 0.4, 0.6, 0.8, 1, 1.2, 1.4, 1.6, 1.8),
           FUN=function(alpha) scaleDesc(dataDesc, alpha))
  })
  )

## multiply by powers of 1.2 such that at most 3000 datapoints
expandDatasetDesc <- unlist(
  lapply(initialDatasetDesc, FUN=function(dataDesc){
    n <- dataDesc@Lg
    coef_ <- 1.2
    max.pow <- trunc(log(3000/n)/log(coef_)) ## to get something smaller than 3000
    max.pow <- min(max.pow, 5)
    alphas <- coef_^c((max.pow-5):max.pow)
    lapply(alphas, FUN=function(alpha) expandDesc(dataDesc, alpha))
  })
  )


##
allGauss <- c(expandDatasetDesc, scaledDatasetDesc)
allStud  <- lapply(allGauss, FUN=function(d) noiseDesc(d, "Stud"))
allARMA  <- lapply(allGauss, FUN=function(d) noiseDesc(d, "ARMA"))

allNoise <- c(allGauss, allStud, allARMA)


## for each we also compute an estimate of the sd based on the successive differences
## see dataDesc code
allNoise <- lapply(allNoise, FUN=function(dataDesc) simulateMany(dataDesc, nrepeat=nrepeat))

```


## Definition of the changepoint methods

We store changepoint methods as an S4 class with a name and a function taking the data as an input and returning a set of changes.
Here is the code for our first changepoint method.
All others are defined in the InitialCptMethod.R F file.

Similarly we define a S4 class for methods returning a fixed number of changepoints.
All methods are defined in the R file InitialCptMethod_knowK.R.
We will use these methods to get the performance if the number of changes is known.


```{r cpt_method, message=F}
source("utils/all_packages_and_a_few_functions.R")
source("CptMethod.R")
listMethod <- list()
i <- 1
##################################################################################################
## PELT default and mBIC, for Yao's see Fpop
listMethod[[i]] <- new("CptMethod",
                       Name = "Pelt-Dft",
                       Fun = function(y){
                         w <- cpt.mean(y,method="PELT")
                         return(c(cpts(w), length(y)))
                       })
i <- i+1

source("InitialCptMethod.R")        ## Method infering the number of changes
source("InitialCptMethod_knowK.R")  ## Method for a fixed number of changes (less)
```

Here is a list of all methods :


- "Pelt-Dft" : pelt (changepoint package) with its default parameter.
- "Fpop-Yao" : fpop (gfpop package) with the penalty of Yao 1989.
- "Rpop-Yao" : robust fpop (gfpop package) with the penalty of Yao 1989.
- "Fpop-Yao-Ha" : fpop with the penalty of Yao 1989 and the Hall estimate of the variance (rather than the MAD based estimate).
- "Fpop-Yao-L4" :  fpop with the penalty of Yao 1989 and minimum segment length of 4.
- "Fpop-Crops-Lb" : Crops and fpop with the penalty of Lebarbier 2005. The min penalty is set to $2\log(n) - 2*\log(K_{max})+3$ with $K_{max}=\min(n/2-1, 3*n/\log(n))$ and the max penalty to $2\log(n)+3$.
- "Fpop-Crops-Lb-L4", Crops and fpop with the penalty of Lebarbier 2005 and minimum segment length of 4. The min penalty is set to $2\log(n) - 2*\log(K_{max})+3$ with $K_{max}=\min(n/4-1, 3*n/\log(n))$ and the max penalty to $2\log(n)+3$.
- "Fpop-Crops-Lb-Ha", Crops and fpop with the penalty of Lebarbier 2005 and the Hall estimate of the variance.
- "Fpop-Crops-Lb-Ha-L4", Crops and fpop with the penalty of Lebarbier 2005 and the Hall estimate of the variance and minimum segment length of 4.
- "Fpop-Crops-Lb-Rob" : Crops and fpop using the biweight loss (gfpop package) with the penalty of Lebarbier 2005.
- "Fpsn-Lb" : pDPA (jointseg package) with the penalty of Lebarbier 2005 (with maximum number of changepoint of $3n/\log(n)$.
- "FDRseg-.1" : FDRseg with a quantile $\alpha$ of 0.1 (defaut). Quantile are estimated once for each $n$ using $1000/\alpha$ monte-carlo simulation rather than the default $50/\alpha$.
- "FDRseg-.05" : FDRseg with a quantile alpha of 0.05.
- "Bft-Tguh" : TGUH (Breakfast package) with its default parameters.
- "Bft-Wbs" : WBS (Breakfast package) with its default parameters.
- "Bft-Hyb" : Hybrid (TGUH/WBS) approach the breakfast package with its default parameters.
- "Wbs-Sic" : WBS with the SIC criteria (wbs package).
- "Wbs-Th1" : WBS with at threshold of 1 (Wbs package).
- "Wbs2-.9", : WBS2 with a $\lambda$ of 0.9 (Wbs2 github code).
- "Wbs2-.95" : WBS2 with a $\lambda$ of 0.95 (Wbs2 github code).
- "IDetect" : IDetect (IDetect package) with its default parameters.
- "mosum" : Multiscale local pruning approach (mosum package) with its default parameters.

Here is a list of all methods for a known K :

- "MaxLik-K*" : pDPA (jointseg package) with known K.
- "Bft-Wbs-K*" : WBS (wbs package) with known K.
- "BinSeg-K*" : Binary Segmentation (wbs package) with known K.
- "Rpop-K*" : Robust gfpop (gfpop package) with known K. In case K is larger than 100 the gfpop is not runned and it returns K=1.

## Measuring segmentation accuracy

The dataDesc S4 class includes a score function. It has one parameter which is the output of the Run function of the CptMethod S4 class (it includes a cpt and runtime).

This function can be modified.

We measure :

- "mse" : the mse.
- "K" : the absolute difference between the true number of changes and the estimated number of changes
- "ari" : 1 minus the ari between the true and estimated segmentations.
- "time" : the runtime

For all scores the closer to $0$ the better the method.

# Running all approaches

## Generating quantiles for fdrseg

For FDRseg, to speed calculations it make sense to generate quantiles once.
This is what we do below.

```{r fdrseg, message=F, results='hide'}
signal.length <- sort(unique(sapply(allNoise, function(x) x@Lg)))
for(alpha in c(0.1)){
mclapply(signal.length, FUN=function(n){
  file.quantile <- paste("Simu_FDRseg/fdrseg_qtl", n, "alpha=", alpha, ".RData", sep="")
  if(!file.exists(file.quantile)){
   r = round(1000/min(alpha, 1 - alpha)) ## default is 50 (we run this once so 1000)
  qtl.fdrseg <- simulQuantile(1 - alpha, n, r, "fdrseg")
  save(qtl.fdrseg, file=file.quantile)
  }
}, mc.cores=mc.cores)
}

```


## Running (K unknown)

All methods are used on all dataset after an initial
scaling of the data using a difference based estimator
of the standard deviation.

Methods are scored for there mse, ARI, runtime and estimation of the true number of changepoints.

```{r run_score}

## checking that all packages are working
## TESTING PURPOSES
## IF FALSE = Not-Executed
if(FALSE){
lmat <- list()
for(i in 1:length(allNoise)){
print(i)
dataDesc <- allNoise[[i]]
lineScore <- list()
 for(imet in 1:length(listMethod)){
  cptMet <- listMethod[[imet]]
  load(dataDesc@fileDt)
  load(dataDesc@fileRs)
  data <- toSaveDt$allDatasets[[1]]
  #cptMet@Fun(data$data/data$sd.est)
  output <- runMethod(object=cptMet, y=data$data/data$sd.est)
  lineScore[[imet]] <- score(dataDesc, output, data$data)
 }
 lmat[[i]] <- do.call(rbind, lineScore)
 lmat[[i]]$i <-i

 lineScore <- list()
 for(imet in 1:length(listMethod_knownK)){
  cptMet <- listMethod_knownK[[imet]]
  load(dataDesc@fileDt)
  load(dataDesc@fileRs)
  data <- toSaveDt$allDatasets[[1]]
  #cptMet@Fun(data$data/data$sd.est)
  output <- runMethodK(object=cptMet, y=data$data/data$sd.est, K = dataDesc@K)
  lineScore[[imet]] <- score(dataDesc, output, data$data)
 }
 lmat[[i]] <- do.call(rbind, lineScore)
 lmat[[i]]$i <-i
}
}

##########################################
## RUN ALL approaches
for(dataDesc in allNoise){
  load(dataDesc@fileDt)
  load(dataDesc@fileRs)

  for(cptMet in listMethod){

    ## if not already done run approach
    if(is.null(toSaveRs$allMethods[[cptMet@Name]])){

      scoreMet <- mclapply(toSaveDt$allDatasets, FUN=function(data){
        output <- runMethod(cptMet, data$data/data$sd.est)
        lineScore <- score(dataDesc, output, data$data)
        return(lineScore)
       }, mc.cores=mc.cores)

      matScore <- rbindlist(scoreMet)
      matScore$index <- 1:nrow(matScore)
      matScore$method <- cptMet@Name
      toSaveRs$allMethods[[cptMet@Name]] <- data.frame(matScore)
      save(toSaveRs, file=dataDesc@fileRs)

    }
}
}
```

## Running (K known)

Methods are then score for there MSE, ARI, runtime and estimation of the true number of changepoints.
The number of changepoints should always be correct (except for Robseg for large K*, there are some optimisation
issue due to segment length).


```{r mlknownk}
## RUN ML and WBS with known K (idea measure the difference
## between getting changes and mse and getting K)
for(dataDesc in allNoise){
  load(dataDesc@fileDt)
  load(dataDesc@fileRs)


    for(cptMet in listMethod_knownK){
    ## if not already done run approach
    if(is.null(toSaveRs$allMethods[[cptMet@Name]])){

      scoreMet <- mclapply(toSaveDt$allDatasets, FUN=function(data){
        output <- runMethodK(cptMet, data$data/data$sd.est, toSaveDt$description@K)
        lineScore <- score(dataDesc, output, data$data)
        return(lineScore)
       }, mc.cores=mc.cores)

      matScore <- rbindlist(scoreMet)
      matScore$index <- 1:nrow(matScore)
      matScore$method <- cptMet@Name
      toSaveRs$allMethods[[cptMet@Name]] <- data.frame(matScore)
      save(toSaveRs, file=dataDesc@fileRs)

    }
}


}
```


# Results

## A rough summary


We start by a fairly rough overview of the results.
Lets define $Score_{mdi}^{(c)}$ the score of method $m$ for criteria $c$, on dataset $d$ and replicate $i$.

Here we consider that a method $m$ is better than a method $m'$ in terms of a score $c$ if
the number of times $m$ is at least as good than $m'$ is larger than the number of times
$m'$ is at least as good as $m$. We then report how often $m$ is better than any $m'$.
In details :

1) we count the number of times an approach is at least as good as another one in terms of mse, ari or K. Defining
$\mathbf{1}$ the indicator function, this is
is $$S(m, m') = \sum_d \sum_i \mathbf{1}(Score_{mdi}^{(c)} \leq Score_{m'di}^{(c)})$$

2) for each method $m$ we report (in percentage :percAE) the average $S(m, m')$ : $\frac{100}{M-1} \sum_{m'} S(m, m'),$ where $M$ is the number of methods.

3) we also count how many times $S(m, m')$ is strictly larger than $S(m',m)$ and rank approches based on this score.

Below we report this for the Gauss, Student and ARMA simulations.
The closer the final score is to 100 the better the approach is in terms of the criteria.
The methods are sorted from smallest to largest based on the final MSE score.
Unsurprisngly methods knowing $K$ have better results and are
at the bottom of the table.

```{r counting_per_dataset}

Noises <- c("Gauss", "Stud", "ARMA")
score.Name <- c("ari", "mse", "K")
allUnknown <- c("Pelt-Dft", "Fpop-Yao", "Fpop-Yao-Ha", "Fpop-Yao-L4",
                "Fpop-Crops-Lb", "Fpop-Crops-Lb-L4", "Fpop-Crops-Lb-Ha",
                "Fpop-Crops-Lb-Ha-L4", "Fpop-Crops-Lb-Rob",  "Fpsn-Lb",
                "FDRseg-.1", "Bft-Tguh", "Bft-Hyb", "Wbs-Sic", "Wbs-Th1",
                "Wbs2-.9", "Wbs2-.95", "IDetect", "mosum", "Rpop-Yao")
## additionnal Bft-Wbs and FDRseg-.05

someKnown <- c("Pelt-Dft", "Fpop-Yao", "Fpop-Crops-Lb", "Fpop-Crops-Lb-L4", "FDRseg-.1",
               "Bft-Tguh", "Bft-Wbs", "Wbs2-.9", "Wbs2-.95",
               "IDetect", "mosum", "Rpop-Yao")

minSizeKnown <- c("Fpop-Yao-L4", "Fpop-Crops-Lb-L4",
                  "Fpop-Crops-Lb-Ha-L4", "IDetect", "mosum")
allKnown <- c("MaxLik-K*", "Bft-Wbs-K*", "BinSeg-K*",  "Rpop-K*")

## COMPARE
getCompareMat <- function(i_s, i_n, subsetMethods){
  files <- list.files("simulated_dataset/Res/", pattern=Noises[i_n], full.names = T)
  allScores <- lapply(files, function(file_){
   load(file_)
   dat <- sapply(toSaveRs$allMethods, FUN=function(x) x[[score.Name[i_s]]])
   dat <- dat[, order(colnames(dat))]
   dat <- dat[, colnames(dat) %in% subsetMethods]
   dat
  })

  matAllScores <- do.call(rbind, allScores)
  compareMat <- matrix(nr=ncol(matAllScores), nc=ncol(matAllScores))
  ## atleast as Good (less than mse, ari, k...)
  rownames(compareMat) <- colnames(matAllScores)
  colnames(compareMat) <- colnames(matAllScores)
  for(i in 1:ncol(compareMat)){
    for(j in 1:ncol(compareMat)){
    compareMat[i, j] <- mean(matAllScores[, i] <= matAllScores[, j], na.rm=T)
  }
  }
  return(compareMat)
}


## VOTE
getVote <- function(i_s, i_n, subsetMethods){
 essai <- getCompareMat(i_s, i_n, subsetMethods)
 countV <- (essai >= t(essai))
 diag(countV) <- F
 diag(essai)  <- 0
 #round(100*rowSums(countV)/(nrow(countV)-1), 1)
 # ranking with ex-aequo
 data.frame(rank=match(rowSums(countV), sort(rowSums(countV), decreasing = T)),
        percAE=round(100*rowSums(essai)/(nrow(countV)-1)))
}

```

For all methods not knowing K.

```{r}

## AT LEAST AS GOOD
## Gauss All Unknown K
perc <- do.call(cbind, lapply(1:3, FUN=getVote, i_n=1, subsetMethods=allUnknown))
colnames(perc) <- paste(rep(score.Name, each=2), colnames(perc), sep="-")
perc[order(perc[, 3]), ]
## Stud All Unknown K
perc <- do.call(cbind, lapply(1:3, FUN=getVote, i_n=2, subsetMethods=allUnknown))
colnames(perc) <- paste(rep(score.Name, each=2), colnames(perc), sep="-")
perc[order(perc[, 3]), ]

## ARMA All Unknown K
perc <- do.call(cbind, lapply(1:3, FUN=getVote, i_n=3, subsetMethods=allUnknown))
colnames(perc) <- paste(rep(score.Name, each=2), colnames(perc), sep="-")
perc[order(perc[, 3]), ]
```

For a subset of methods not knowing K
```{r}

## AT LEAST AS GOOD
## Gauss Subset Unknown K
perc <- do.call(cbind, lapply(1:3, FUN=getVote, i_n=1, subsetMethods=someKnown))
colnames(perc) <- paste(rep(score.Name, each=2), colnames(perc), sep="-")
perc[order(perc[, 3]), ]
## Stud Subset Unknown K
perc <- do.call(cbind, lapply(1:3, FUN=getVote, i_n=2, subsetMethods=someKnown))
colnames(perc) <- paste(rep(score.Name, each=2), colnames(perc), sep="-")
perc[order(perc[, 3]), ]

## ARMA Subset Unknown K
perc <- do.call(cbind, lapply(1:3, FUN=getVote, i_n=3, subsetMethods=someKnown))
colnames(perc) <- paste(rep(score.Name, each=2), colnames(perc), sep="-")
perc[order(perc[, 3]), ]
```

For all methods knowing K
```{r}

## AT LEAST AS GOOD
## Gauss Subset known K
perc <- do.call(cbind, lapply(1:3, FUN=getVote, i_n=1, subsetMethods=allKnown))
colnames(perc) <- paste(rep(score.Name, each=2), colnames(perc), sep="-")
perc[order(perc[, 3]), ]
## Stud Subset known K
perc <- do.call(cbind, lapply(1:3, FUN=getVote, i_n=2, subsetMethods=allKnown))
colnames(perc) <- paste(rep(score.Name, each=2), colnames(perc), sep="-")
perc[order(perc[, 3]), ]

## ARMA Subset known K
perc <- do.call(cbind, lapply(1:3, FUN=getVote, i_n=3, subsetMethods=allKnown))
colnames(perc) <- paste(rep(score.Name, each=2), colnames(perc), sep="-")
perc[order(perc[, 3]), ]
```

## A rough summary per dataset

Here we report the score of the previous section computed per simulated datasets.
That is again we consider that a method $m$ is better than a method $m'$ in terms of a score $c$ if
the number of times $m$ is at least as good than $m'$ is larger than the number of times
$m'$ is at least as good as $m$. In totals there are 495 simulated datasets.
We split those in 3 according to the noise type Gaus, Student or ARMA.
and we represent the final score for MSE and K graphically.
Datasets are regrouped in terms of the 11 initial datasets and sorted in terms of the
number of changes.


```{r per_dataset, fig.width=16, fig.height=12, cache=FALSE}

## COMPARE PER DATASET
getCompareMat_perDt <- function(i_s, i_n, subsetMethods){ ## i_s = ARI:1, MSE:2, K:3, i_n = Gauss:1, Stud:2, ARMA:3
  files <- list.files("simulated_dataset/Res/", pattern=Noises[i_n], full.names = T)
  allAtLeastAsGood <- lapply(files, function(file_){
   load(file_)
   dat <- sapply(toSaveRs$allMethods, FUN=function(x) x[[score.Name[i_s]]])
   dat <- dat[, order(colnames(dat))]
   dat <- dat[, colnames(dat) %in% subsetMethods]
   compareMat <- matrix(nr=ncol(dat), nc=ncol(dat))
   for(i in 1:ncol(compareMat)){
    for(j in 1:ncol(compareMat)){
    compareMat[i, j] <- sum(dat[, i] <= dat[, j], na.rm=T)
    }
   }
   colnames(compareMat) <- rownames(compareMat) <- colnames(dat)
   data.frame(Method = rownames(compareMat), score = rowSums(compareMat) - diag(compareMat), K=toSaveRs$description@K, n=toSaveRs$description@Lg,
              dataset=toSaveRs$description@Name, sigma=toSaveRs$description@sigma)
  })
  data <- do.call(rbind, allAtLeastAsGood)
  data$scoreN <- data$score / (nrepeat*(nlevels(data$Method)-1)) *100
  subdata <- data[match(unique(data$dataset), data$dataset), ]
  subdata$type <- "s"; subdata$type[grep("_x_", subdata$dataset)] <- "x"
  subdata <- subdata[order(subdata$K, subdata$n, subdata$sigma), ]
  data$Name <- factor(data$dataset, levels=as.vector(subdata$dataset))
  ## plotting

  ## order first kstar approaches
  istar    <- grep("K\\*", levels(data$Method))
  if(length(istar)>0){
  inotstar <- (1:nlevels(data$Method))[-c(istar)]
  data$Method <- factor(  data$Method, levels = levels(data$Method)[c(istar, inotstar)])
  }


  p <- ggplot(data, aes(y = Method, x = Name)) +
  geom_raster(aes(fill=scoreN)) +
  scale_fill_gradient2(low="blue", mid="white", high="red", midpoint = 50) +
  labs(x="Method", y="Data", title="Matrix") +
  theme_bw() + theme(axis.text.x=element_text(size=3, angle=90, vjust=0.3),
                     axis.text.y=element_text(size=15),
                     plot.title=element_text(size=11))
  p
}
```

For all methods not knowing K.

```{r, fig.width=16, fig.height=12}

## GAUSS
all <- lapply(1:3, FUN=getCompareMat_perDt, i_n=1, subsetMethods=allUnknown)
grid.arrange(#all[[1]]+ggtitle("Gauss - ARI"),
             all[[2]]+ggtitle("Gauss - MSE"),
             all[[3]]+ggtitle("Gauss - K"),
             nrow=2, ncol=1)

##################################################################
## STUD
all <- lapply(1:3, FUN=getCompareMat_perDt, i_n=2, subsetMethods=allUnknown)
grid.arrange(#all[[1]]+ggtitle("Student - ARI"),
             all[[2]]+ggtitle("Student - MSE"),
             all[[3]]+ggtitle("Student - K"),
             nrow=2, ncol=1)


##################################################################
## ARMA
all <- lapply(1:3, FUN=getCompareMat_perDt, i_n=3, subsetMethods=allUnknown)
grid.arrange(#all[[1]]+ggtitle("ARMA - ARI"),
             all[[2]]+ggtitle("ARMA - MSE"),
             all[[3]]+ggtitle("ARMA - K"),
             nrow=2, ncol=1)


```

For a subset of methods not knowing K.

```{r, fig.width=16, fig.height=12}

## GAUSS
all <- lapply(1:3, FUN=getCompareMat_perDt, i_n=1, subsetMethods=someKnown)
grid.arrange(#all[[1]]+ggtitle("Gauss - ARI"),
             all[[2]]+ggtitle("Gauss - MSE"),
             all[[3]]+ggtitle("Gauss - K"),
             nrow=2, ncol=1)

##################################################################
## STUD
all <- lapply(1:3, FUN=getCompareMat_perDt, i_n=2, subsetMethods=someKnown)
grid.arrange(#all[[1]]+ggtitle("Student - ARI"),
             all[[2]]+ggtitle("Student - MSE"),
             all[[3]]+ggtitle("Student - K"),
             nrow=2, ncol=1)


##################################################################
## ARMA
all <- lapply(1:3, FUN=getCompareMat_perDt, i_n=3, subsetMethods=someKnown)
grid.arrange(#all[[1]]+ggtitle("ARMA - ARI"),
             all[[2]]+ggtitle("ARMA - MSE"),
             all[[3]]+ggtitle("ARMA - K"),
             nrow=2, ncol=1)


```

## A few datasets

```{r a_few_datasets, echo=F, fig.width=12, fig.height=12, message=F, cache=FALSE}
source("getPlotDatasetNoise.R")
mName_ <- c("Fpop-Yao", "FDRseg-.1", "Fpop-Crops-Lb-Ha-L4", "Wbs-Sic", "mosum", "IDetect")
i_datasets <- c(1, 3, 4, 5)
noises <- c("Gaus", "Stud", "ARMA")
res <- lapply(i_datasets, FUN=function(i_d){
  lapply(noises, FUN=getPlotDatasetNoise, append_="x", dataset=initialDatasetDesc[[i_d]], mName_=mName_)
})


  lay <- matrix(1:20, nrow=5, ncol=4, byrow=T)

  grid.arrange(res[[1]][[1]]$data.ex, res[[2]][[1]]$data.ex, res[[3]][[1]]$data.ex, res[[4]][[1]]$data.ex,
                 #allP[[1]]$legend,
                 res[[1]][[1]]$mse, res[[2]][[1]]$mse, res[[3]][[1]]$mse, res[[4]][[1]]$mse,
                 res[[1]][[2]]$mse, res[[2]][[2]]$mse, res[[3]][[2]]$mse, res[[4]][[2]]$mse,
                 res[[1]][[3]]$mse, res[[2]][[3]]$mse, res[[3]][[3]]$mse, res[[4]][[3]]$mse,
               res[[1]][[1]]$legend,
               layout_matrix=lay)


```


## Detailed overview for 7 approaches

Here for each of the 11 initial datasets we report the results
of 7 approaches. For each initial datasets there are two scenarios:

1. one where the size of the data is varied and the standard deviation vary with $\frac{1}{\sqrt{n}}$
2. one where variance is varied and $n$ is kept constant.

We have plot for the mean ARI,mean MSE, mean ability to recover the true number of changepoints and mean runtime.
The results are reported a a function of $n$ or $\sigma$.


```{r ploting_scenario_unknownK, echo=F, fig.width=12, fig.height=12, message=F, cache=FALSE}

source("getPlotDatasetNoise.R")
mName_ <- c("Fpop-Yao", "Fpop-Crops-Lb-Ha", "Fpop-Crops-Lb-Ha-L4", "Wbs2-.95", "mosum", "IDetect")

plotDataSet <- function(dataset, mName_){
  noises <- c("Gaus", "Stud", "ARMA")
  allP <- lapply(noises, FUN=getPlotDatasetNoise, append_="x", dataset=dataset, mName_=mName_)
  lay <- rbind(c(1,1,2), 3:5, 6:8) #, 9:11, 12:14)

    grid.arrange(allP[[1]]$data.ex, allP[[1]]$legend,
                 allP[[1]]$mse, allP[[2]]$mse, allP[[3]]$mse,
                 #allP[[1]]$ari, allP[[2]]$ari, allP[[3]]$ari,
                 allP[[1]]$k, allP[[2]]$k, allP[[3]]$k,
                 #allP[[1]]$time, allP[[2]]$time, allP[[3]]$time,
                   layout_matrix=lay)

  allP <- lapply(noises, FUN=getPlotDatasetNoise, append_="s", dataset=dataset, mName_=mName_)


    grid.arrange(allP[[1]]$data.ex, allP[[1]]$legend,
                 allP[[1]]$mse, allP[[2]]$mse, allP[[3]]$mse,
                 #allP[[1]]$ari, allP[[2]]$ari, allP[[3]]$ari,
                 allP[[1]]$k, allP[[2]]$k, allP[[3]]$k,
                 #allP[[1]]$time, allP[[2]]$time, allP[[3]]$time,
                   layout_matrix=lay)

}


for(i_d in 1:length(initialDatasetDesc)){
########################################################################################################
########################################################################################################
plotDataSet(initialDatasetDesc[[i_d]], mName_ = mName_);
}


```


## Detailed overview for 4 approaches knowing K

Here for each of the 11 initial datasets we report the results
of 4 approaches knowing K. For each initial datasets there are two scenarios:

1. one where the size of the data is varied and the standard deviation vary with $\frac{1}{\sqrt{n}}$
2. one where variance is varied and $n$ is kept constant.

We have plot for the mean ARI, mean ability to recover the true number of changepoints, mean Mse and mean runtime.
The results are reported a a function of $n$ or $\sigma$.


```{r ploting_scenario_knownK, echo=F, fig.width=12, fig.height=12, message=F, cache=FALSE}

source("getPlotDatasetNoise.R")
mName_ <- c("MaxLik-K*",  "Bft-Wbs-K*", "BinSeg-K*",  "Rpop-K*")


plotDataSet <- function(dataset, mName_){
  noises <- c("Gaus", "Stud", "ARMA")
  allP <- lapply(noises, FUN=getPlotDatasetNoise, append_="x", dataset=dataset, mName_=mName_)
  lay <- rbind(c(1,1,2), 3:5, 6:8) #, 9:11)

    grid.arrange(allP[[1]]$data.ex, allP[[1]]$legend,
                 allP[[1]]$mse, allP[[2]]$mse, allP[[3]]$mse,
                 #allP[[1]]$ari, allP[[2]]$ari, allP[[3]]$ari,
                 allP[[1]]$k, allP[[2]]$k, allP[[3]]$k, ## sanity check
                   layout_matrix=lay)

  allP <- lapply(noises, FUN=getPlotDatasetNoise, append_="s", dataset=dataset, mName_=mName_)


    grid.arrange(allP[[1]]$data.ex, allP[[1]]$legend,
                 allP[[1]]$mse, allP[[2]]$mse, allP[[3]]$mse,
                 #allP[[1]]$ari, allP[[2]]$ari, allP[[3]]$ari,
                 allP[[1]]$k, allP[[2]]$k, allP[[3]]$k,
                   layout_matrix=lay)

}

for(i_d in 1:length(initialDatasetDesc)){
########################################################################################################
########################################################################################################
plotDataSet(initialDatasetDesc[[i_d]], mName_ = mName_);
}


```