Different results in R's `dtw` and Python's `dtw-python`

Question

I am exploring some alternatives to compute Dynamic Time Warping (DTW) distances in Python. And I am struggling with the limited documentation about the packages I am finding...

I have explored dtaidistance's distance_fast and distance; fastdtw's fastdtw and dtw-python's dtw functions. They are leading to different results... I am sure it is related to some default parameters that sometimes are not so clear in the documentation, but I was not able to find any explanations about...

But my question at this moment here is about dtw-python. Specifically, a comparison of the Python's dtw-python package results with the R's dtw package results.

They are direct equivalent each other, as indicated here. And the major function documentation is very similar. So it is really weird...

The R code:

data=read.table('synthetic_control.data',header=FALSE)
library(dtw)
dtw(x=data[1,], y=data[2,] ,
               window.type= "sakoechiba", window.size= 5)$distance
dtw(x=data[2,], y=data[3,] ,
               window.type= "sakoechiba", window.size= 5)$distance
dtw(x=data[2,], y=data[600,],
               window.type= "sakoechiba", window.size= 5)$distance
dtw(x=data[1,], y=data[600,],
               window.type= "sakoechiba", window.size= 5)$distance
dtw(x=data[3,], y=data[600,],
               window.type= "sakoechiba", window.size= 5)$distance

Results:

[1] 42.181 [1] 35.09292
[1] 105.0999
[1] 110.9285
[1] 105.9934

(Edit: the above values are Euclidian distances)

The Python code:

import pandas as pd
data =  pd.read_csv("synthetic_control.data", 
                    header=None,delimiter=r"\s+")
from dtw import *
print(dtw(x=data_a.loc[0], y=data_a.loc[1] ,
               window_args= {"window_type": "sakoechiba", "window_size": 5},
               keep_internals=True).distance)
print(dtw(data_a.loc[1], data_a.loc[2] ,
               window_args= {"window_type": "sakoechiba", "window_size": 5},
               keep_internals=True).distance)
print(dtw(data_a.loc[1], data_a.loc[599] ,
               window_args= {"window_type": "sakoechiba", "window_size": 5},
               keep_internals=True).distance)
print(dtw(data_a.loc[0], data_a.loc[599] ,
               window_args= {"window_type": "sakoechiba", "window_size": 5},
               keep_internals=True).distance)
print(dtw(data_a.loc[2], data_a.loc[599] ,
               window_args= {"window_type": "sakoechiba", "window_size": 5},
               keep_internals=True).distance)

Results:

166.58120000000002 165.87640000000005 647.89322 604.29862 663.3971200000001

The example above use synthetic dataset available here, and each row represent a time serie.

As pointed above, the intention was to set the same parameters (windows type, windows size)... and the defaults appears to be the same for both python and R... What am I missing here?

Thanks in advance,

Different results in R's `dtw` and Python's `dtw-python`

Answers (1)

Related Questions

Different results in R&#39;s `dtw` and Python&#39;s `dtw-python`

Answers (1)

Related Questions

Different results in R's `dtw` and Python's `dtw-python`