Haskell criterion data

Question

I'm using criterion to benchmark a function. The end goal is to create a box plot, for different inputs to this function, however, to my understanding of criterion, the output is not enough for this.

Example bench from $ cabal bench:

benchmarking...
measurement took 6.317 s
analysing with 1000 resamples
bootstrapping with 7 of 7 samples (100%)
time                 174.5 ms   (172.7 ms .. 177.0 ms)
                     1.000 R²   (1.000 R² .. 1.000 R²)
mean                 176.0 ms   (175.0 ms .. 177.2 ms)
std dev              1.524 ms   (993.5 μs .. 1.943 ms)
variance introduced by outliers: 12% (moderately inflated)

Looking at the docs, I thought I could get all measured times by using benchmarkWith along with Config instead of just benchmark but I'm missing something.

My current attempt:

import Criterion
import Criterion.Main
import Criterion.Types
import Data.Vector (Vector)
import qualified Data.Vector as V

config :: Config
config = defaultConfig 
    { csvFile = Just "benchmark.csv"
    , verbosity = Verbose
    , rawDataFile = Just "rawbench.bin"
    , template = "template.txt"
    }
main :: IO ()
main = do
    report <- benchmarkWith' config $ nf (fun x) z
    let a = reportMeasured report :: Vector Measured
    let b = fmap measTime a :: Vector Double
    writefile "times.txt" $ show b

csvFile gives the values in the example in .csv format - not all data values. I do not yet know how the rawDataFile, nor the template looks as they aren't created. The Vector attained from measTime reportMeasured includes times that are far of from any of the times in the ranges shown in the analysis, so I'm unsure what to do with it. They seem to be saved as the total elapsed time, and not the elapsed time for each iteration:

b = [0.1781445460001123,0.3514027309975063,0.5219033929970465,0.7031883550007478,0.8739448289998109,1.049261161002505,1.2441970670006413]

If we adjust for this via

let c = V.zipWith (-) b (V.cons 0 (V.init b))

we get

c = [0.17789985700073885,0.17817427000045427,0.17363899500196567,0.16757315399809158,0.17504766799902427,0.18031946599876392,0.17502718700052355]

Which seem more plausible. Some values might be outside the range of the analysis, but they're not 3x the max value any more.

Is it possible to acquire the full list of measured times - is the above correct? It seems the number of saved times aren't the number of resamples, as these may be only used for the analysis. Is it possible to adjust the number of samples?

Haskell criterion data

Answers (1)

Related Questions