Benchmark¶

A Performance Comparison to PyEMMA (and deeptime)¶

In the following comparison we want to show that the performance of msmhelper is competitive to that of PyEMMA. It should be kept in mind that while both packages focus on the construction and analysis of MSM, PyEMMA has much more functionality and both packages approach Markov State modeling with a different philosophy. Nevertheless, in the following we will compare a few functions that both packages provide. We will ignore the linear algebra methods, since both packages rely on numpy.

A comparison to PyEMMA's successor, deeptime, has been added. However, it should be noted that while PyEMMA's focus was on MSM analysis, deeptime's focus is on modern machine learning analysis. Therefore, the comparison should not be seen as a benchmark but rather as a rough baseline.

In general, performance depends very much on the architecture and Python/package versions. Therefore, it is best to look at the benchmark results on your own device. To do this, you can simply download the Jupyter Notebook from the icon above.

In [1]:

Copied!





# importing packages needed for benchmark
import deeptime
import msmhelper as mh
import numpy as np
import pyemma
import prettypyplot as pplt
from matplotlib import pyplot as plt

# change matplotlbiu default style
pplt.use_style(figsize=(6, 2), latex=False)


# method to visualize results
def visualize_benchmark_results(times, labels, title):
    fig, ax = plt.subplots()

    times = np.array([t.average for t in times])
    bar = ax.bar(labels, 1 / times, color=plt.colormaps['paula'].colors)
    ax.bar_label(bar,
        labels=[f'x{fac:.1f}' for fac in times[0] / times],
    )
    ax.set_ylabel(r'runs per second [1/s]')
    ax.set_ylim(np.min(1 / times) / 3, np.max(1 / times) * 3)
    ax.set_yscale('log')
    
    # highlight relative performance
    ax.set_title(title)


def generate_traj(n_steps, n_states):
    """Generate random state trajectory."""
    return np.random.randint(low=1, high=n_states + 1, size=n_steps)
# importing packages needed for benchmark
import deeptime
import msmhelper as mh
import numpy as np
import pyemma
import prettypyplot as pplt
from matplotlib import pyplot as plt

# change matplotlbiu default style
pplt.use_style(figsize=(6, 2), latex=False)


# method to visualize results
def visualize_benchmark_results(times, labels, title):
    fig, ax = plt.subplots()

    times = np.array([t.average for t in times])
    bar = ax.bar(labels, 1 / times, color=plt.colormaps['paula'].colors)
    ax.bar_label(bar,
        labels=[f'x{fac:.1f}' for fac in times[0] / times],
    )
    ax.set_ylabel(r'runs per second [1/s]')
    ax.set_ylim(np.min(1 / times) / 3, np.max(1 / times) * 3)
    ax.set_yscale('log')
    
    # highlight relative performance
    ax.set_title(title)


def generate_traj(n_steps, n_states):
    """Generate random state trajectory."""
    return np.random.randint(low=1, high=n_states + 1, size=n_steps)

Estimation of a Markov State Model¶

In [2]:

Copied!





# create random trajectory
n_steps = int(1e5)
n_states = 10
lagtime = 100
# create random trajectory
n_steps = int(1e5)
n_states = 10
lagtime = 100

Using numba the source code gets compiled just-in-time on the first usage. Hence, we need to run the code once in advance to measure the performance without compilation time. For further informations see the numba docs.

In [3]:

Copied!





traj = generate_traj(n_steps, n_states),
tmp = mh.msm.estimate_markov_model(traj, lagtime=lagtime)
tmp = mh.msm.implied_timescales(traj, [lagtime], ntimescales=2)
tmp = mh.msm.ck_test(traj, [lagtime], tmax=1000)
tmp = mh.msm.timescales.propagate_MCMC(traj, lagtime, 100)
traj = generate_traj(n_steps, n_states),
tmp = mh.msm.estimate_markov_model(traj, lagtime=lagtime)
tmp = mh.msm.implied_timescales(traj, [lagtime], ntimescales=2)
tmp = mh.msm.ck_test(traj, [lagtime], tmax=1000)
tmp = mh.msm.timescales.propagate_MCMC(traj, lagtime, 100)

In the following, we compare the determination of the Markov State Model from a numpy state trajectory.

In [4]:

Copied!

%%timeit -r 10 -n 1 -q -o traj = generate_traj(n_steps, n_states)
msm = mh.msm.estimate_markov_model(traj, lagtime=lagtime)
%%timeit -r 10 -n 1 -q -o traj = generate_traj(n_steps, n_states)
msm = mh.msm.estimate_markov_model(traj, lagtime=lagtime)

Out[4]:

<TimeitResult : 2.51 ms ± 93 µs per loop (mean ± std. dev. of 10 runs, 1 loop each)>

In [5]:

Copied!

time_msm_mh_traj = _
time_msm_mh_traj = _

In [6]:

Copied!

%%timeit -r 10 -n 1 -q -o traj = generate_traj(n_steps, n_states)
pyemma.msm.estimate_markov_model(traj, lag=lagtime, reversible=False)
%%timeit -r 10 -n 1 -q -o traj = generate_traj(n_steps, n_states)
pyemma.msm.estimate_markov_model(traj, lag=lagtime, reversible=False)

Out[6]:

<TimeitResult : 8.67 ms ± 531 µs per loop (mean ± std. dev. of 10 runs, 1 loop each)>

In [7]:

Copied!

time_msm_pyemma_nonrev = _
time_msm_pyemma_nonrev = _

In [8]:

Copied!

%%timeit -r 10 -n 1 -q -o traj = generate_traj(n_steps, n_states)
pyemma.msm.estimate_markov_model(traj, lag=lagtime)
%%timeit -r 10 -n 1 -q -o traj = generate_traj(n_steps, n_states)
pyemma.msm.estimate_markov_model(traj, lag=lagtime)

Out[8]:

<TimeitResult : 8.24 ms ± 335 µs per loop (mean ± std. dev. of 10 runs, 1 loop each)>

In [9]:

Copied!

time_msm_pyemma_rev = _
time_msm_pyemma_rev = _

If you have already formatted the trajectory, msmhelper is even significantly faster:

In [10]:

Copied!

%%timeit -r 10 -n 1 -q -o traj = mh.StateTraj(generate_traj(n_steps, n_states))
traj.estimate_markov_model(lagtime=lagtime)
%%timeit -r 10 -n 1 -q -o traj = mh.StateTraj(generate_traj(n_steps, n_states))
traj.estimate_markov_model(lagtime=lagtime)

Out[10]:

<TimeitResult : 305 µs ± 30.4 µs per loop (mean ± std. dev. of 10 runs, 1 loop each)>

Comparing to deeptime we face the issue, that only index trajectories are accepted. Hence, we need to shift it first.

In [11]:

Copied!

time_msm_mh_statetraj = _
time_msm_mh_statetraj = _

In [12]:

Copied!





%%timeit -r 10 -n 1 -q -o traj = generate_traj(n_steps, n_states)
# shift trajector to idx trajectory
traj = mh.utils.rename_by_index(traj)
deeptime.markov.msm.MaximumLikelihoodMSM(
    reversible=True,
).fit_fetch(
    deeptime.markov.TransitionCountEstimator(
        lagtime=lagtime,
        count_mode='sliding',
    ).fit_fetch(traj),
    lagtime=lagtime,
)
%%timeit -r 10 -n 1 -q -o traj = generate_traj(n_steps, n_states)
# shift trajector to idx trajectory
traj = mh.utils.rename_by_index(traj)
deeptime.markov.msm.MaximumLikelihoodMSM(
    reversible=True,
).fit_fetch(
    deeptime.markov.TransitionCountEstimator(
        lagtime=lagtime,
        count_mode='sliding',
    ).fit_fetch(traj),
    lagtime=lagtime,
)

Out[12]:

<TimeitResult : 4.31 ms ± 427 µs per loop (mean ± std. dev. of 10 runs, 1 loop each)>

In [13]:

Copied!

time_msm_deeptime = _
time_msm_deeptime = _

Comparing the perforamance we find

In [14]:

Copied!





visualize_benchmark_results(
    (time_msm_pyemma_nonrev, time_msm_pyemma_rev, time_msm_mh_traj, time_msm_mh_statetraj, time_msm_deeptime),
    ('PyEMMA\n non-reversible', 'PyEMMA\n reversible', 'msmhelper\n numpy', 'msmhelper\n StateTraj', 'deeptime\n reversible'),
    f'Benchmarking MSM estimation with $N_\mathrm{{steps}}=10^{np.log10(n_steps):.0f}$ and {n_states} states',
)
visualize_benchmark_results(
    (time_msm_pyemma_nonrev, time_msm_pyemma_rev, time_msm_mh_traj, time_msm_mh_statetraj, time_msm_deeptime),
    ('PyEMMA\n non-reversible', 'PyEMMA\n reversible', 'msmhelper\n numpy', 'msmhelper\n StateTraj', 'deeptime\n reversible'),
    f'Benchmarking MSM estimation with $N_\mathrm{{steps}}=10^{np.log10(n_steps):.0f}$ and {n_states} states',
)

No description has been provided for this image

Increasing the number of states and steps we find:

In [15]:

Copied!

# create random trajectory with more states and frames
n_steps = int(1e7)
n_states = 100
# create random trajectory with more states and frames
n_steps = int(1e7)
n_states = 100

In [16]:

Copied!

%%timeit -r 5 -n 1 -q -o traj = mh.StateTraj(generate_traj(n_steps, n_states))
traj.estimate_markov_model(lagtime=lagtime)
%%timeit -r 5 -n 1 -q -o traj = mh.StateTraj(generate_traj(n_steps, n_states))
traj.estimate_markov_model(lagtime=lagtime)

Out[16]:

<TimeitResult : 26.9 ms ± 230 µs per loop (mean ± std. dev. of 5 runs, 1 loop each)>

In [17]:

Copied!

time_msm_mh_statetraj = _
time_msm_mh_statetraj = _

In [18]:

Copied!

%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
mh.msm.estimate_markov_model(traj, lagtime=lagtime)
%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
mh.msm.estimate_markov_model(traj, lagtime=lagtime)

Out[18]:

<TimeitResult : 374 ms ± 4.53 ms per loop (mean ± std. dev. of 5 runs, 1 loop each)>

In [19]:

Copied!

time_msm_mh_traj = _
time_msm_mh_traj = _

In [20]:

Copied!

%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
pyemma.msm.estimate_markov_model(traj, lag=lagtime, reversible=False)
%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
pyemma.msm.estimate_markov_model(traj, lag=lagtime, reversible=False)

Out[20]:

<TimeitResult : 731 ms ± 1.67 ms per loop (mean ± std. dev. of 5 runs, 1 loop each)>

In [21]:

Copied!

time_msm_pyemma_nonrev = _
time_msm_pyemma_nonrev = _

In [22]:

Copied!

%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
pyemma.msm.estimate_markov_model(traj, lag=lagtime, reversible=True)
%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
pyemma.msm.estimate_markov_model(traj, lag=lagtime, reversible=True)

Out[22]:

<TimeitResult : 733 ms ± 3.56 ms per loop (mean ± std. dev. of 5 runs, 1 loop each)>

In [23]:

Copied!

time_msm_pyemma_rev = _
time_msm_pyemma_rev = _

In [24]:

Copied!





visualize_benchmark_results(
    (time_msm_pyemma_nonrev, time_msm_pyemma_rev, time_msm_mh_traj, time_msm_mh_statetraj),
    ('PyEMMA\n non-reversible', 'PyEMMA\n reversible', 'msmhelper\n numpy', 'msmhelper\n StateTraj'),
    f'Benchmarking MSM estimation with $N_\mathrm{{steps}}=10^{np.log10(n_steps):.0f}$ and {n_states} states',
)
visualize_benchmark_results(
    (time_msm_pyemma_nonrev, time_msm_pyemma_rev, time_msm_mh_traj, time_msm_mh_statetraj),
    ('PyEMMA\n non-reversible', 'PyEMMA\n reversible', 'msmhelper\n numpy', 'msmhelper\n StateTraj'),
    f'Benchmarking MSM estimation with $N_\mathrm{{steps}}=10^{np.log10(n_steps):.0f}$ and {n_states} states',
)

Estimation of Implied Timescales¶

An important property of Markov State models are the implied time scales. These correspond to the $i$-th eigenvalue $\lambda_i$ of the transition matrix $T_{ij}$ and are defined by $$t_i = - \frac{t_\text{lag}}{\log(\lambda_i)}$$

In [25]:

Copied!





# create random trajectory
n_steps = int(1e5)
n_states = 10
n_timescales = 2

# creating lagtimes
lagtimes = np.unique(np.geomspace(1, 100, 20).astype(int))
print(f'lagtimes: {", ".join(lagtimes.astype(str))}')

# catch warnings, because random state trajectory has mainly complex eigenvalues
import warnings
# create random trajectory
n_steps = int(1e5)
n_states = 10
n_timescales = 2

# creating lagtimes
lagtimes = np.unique(np.geomspace(1, 100, 20).astype(int))
print(f'lagtimes: {", ".join(lagtimes.astype(str))}')

# catch warnings, because random state trajectory has mainly complex eigenvalues
import warnings

lagtimes: 1, 2, 3, 4, 5, 6, 8, 11, 14, 18, 23, 29, 37, 48, 61, 78, 100

In [26]:

Copied!





%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
with warnings.catch_warnings():
    warnings.simplefilter('ignore')
    mh.msm.implied_timescales(traj, lagtimes, ntimescales=n_timescales)
%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
with warnings.catch_warnings():
    warnings.simplefilter('ignore')
    mh.msm.implied_timescales(traj, lagtimes, ntimescales=n_timescales)

Out[26]:

<TimeitResult : 11.6 ms ± 1.26 ms per loop (mean ± std. dev. of 5 runs, 1 loop each)>

In [27]:

Copied!

time_impl_mh = _
time_impl_mh = _

In [28]:

Copied!





%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
with warnings.catch_warnings():
    warnings.simplefilter('ignore')
    pyemma.msm.its(
        traj,
        lagtimes,
        nits=n_timescales,
        show_progress=False,
        reversible=False,
        n_jobs=1,  # keeping this to None it does not work with PyEMMA 2.5.12
    ).timescales
%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
with warnings.catch_warnings():
    warnings.simplefilter('ignore')
    pyemma.msm.its(
        traj,
        lagtimes,
        nits=n_timescales,
        show_progress=False,
        reversible=False,
        n_jobs=1,  # keeping this to None it does not work with PyEMMA 2.5.12
    ).timescales

Out[28]:

<TimeitResult : 199 ms ± 65.4 ms per loop (mean ± std. dev. of 5 runs, 1 loop each)>

In [29]:

Copied!

time_impl_pyemma = _
time_impl_pyemma = _

In [30]:

Copied!





visualize_benchmark_results(
    (time_impl_pyemma, time_impl_mh),
    ('PyEMMA', 'msmhelper'),
    f'Benchmarking Implied Timescale',
)
visualize_benchmark_results(
    (time_impl_pyemma, time_impl_mh),
    ('PyEMMA', 'msmhelper'),
    f'Benchmarking Implied Timescale',
)

Chapman-Kolmogorov Test¶

The most important test to check the Markovianity of a MSM is the Chapman-Kolmogorov test which visualizes the agreement of the Chapman-Kolomogorov equation $$T(\tau n) = T^n(\tau)$$ The following comparison is not easy to interpret, because both packages do not determine the same thing. However, it should be sufficient to get a feeling.

In [31]:

Copied!

%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
mh.msm.ck_test(traj, lagtimes, tmax=1000)
%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
mh.msm.ck_test(traj, lagtimes, tmax=1000)

Out[31]:

<TimeitResult : 38.6 ms ± 260 µs per loop (mean ± std. dev. of 5 runs, 1 loop each)>

In [32]:

Copied!

time_cktest_mh = _
time_cktest_mh = _

In [33]:

Copied!





%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
for lagtime in lagtimes:
    msm = pyemma.msm.estimate_markov_model(traj, lag=lagtime)
    msm.cktest(2, n_jobs=1, show_progress=False)
%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
for lagtime in lagtimes:
    msm = pyemma.msm.estimate_markov_model(traj, lag=lagtime)
    msm.cktest(2, n_jobs=1, show_progress=False)

Out[33]:

<TimeitResult : 2.01 s ± 26.4 ms per loop (mean ± std. dev. of 5 runs, 1 loop each)>

In [34]:

Copied!

time_cktest_pyemma = _
time_cktest_pyemma = _

In [35]:

Copied!





%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
traj = mh.utils.rename_by_index(traj)
models = [
    deeptime.markov.msm.MaximumLikelihoodMSM(
        reversible=True,
    ).fit_fetch(
        deeptime.markov.TransitionCountEstimator(
            lagtime=lagtime,
            count_mode='sliding',
        ).fit_fetch(traj),
        lagtime=lagtime,
    )
    for lagtime in lagtimes
]
models[0].ck_test(models, n_metastable_sets=n_states)
%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
traj = mh.utils.rename_by_index(traj)
models = [
    deeptime.markov.msm.MaximumLikelihoodMSM(
        reversible=True,
    ).fit_fetch(
        deeptime.markov.TransitionCountEstimator(
            lagtime=lagtime,
            count_mode='sliding',
        ).fit_fetch(traj),
        lagtime=lagtime,
    )
    for lagtime in lagtimes
]
models[0].ck_test(models, n_metastable_sets=n_states)

Out[35]:

<TimeitResult : 1.03 s ± 30.4 ms per loop (mean ± std. dev. of 5 runs, 1 loop each)>

In [36]:

Copied!

time_cktest_deeptime = _
time_cktest_deeptime = _

In [37]:

Copied!





visualize_benchmark_results(
    (time_cktest_pyemma, time_cktest_deeptime, time_cktest_mh),
    ('PyEMMA', 'deeptime', 'msmhelper'),
    f'Benchmarking Chapman-Kolmogorov Test',
)
visualize_benchmark_results(
    (time_cktest_pyemma, time_cktest_deeptime, time_cktest_mh),
    ('PyEMMA', 'deeptime', 'msmhelper'),
    f'Benchmarking Chapman-Kolmogorov Test',
)

Propagating a Markov Chain Monte Carlo¶

We now consider the propagation of a Markov chain Monte Carlo, since this plays a central role in the package msmhelper to estimate the time scales.

In [38]:

Copied!





# decrease number of steps to speed up msm estimation
n_steps = int(1e5)
n_states = 10

# number to propagate MCMC
n_mcmc_steps = int(1e5)
# decrease number of steps to speed up msm estimation
n_steps = int(1e5)
n_states = 10

# number to propagate MCMC
n_mcmc_steps = int(1e5)

In [39]:

Copied!

%%timeit -r 5 -n 1 -q -o traj = mh.StateTraj(generate_traj(n_steps, n_states))
mcmc = mh.msm.timescales.propagate_MCMC(traj, lagtime, n_mcmc_steps)
%%timeit -r 5 -n 1 -q -o traj = mh.StateTraj(generate_traj(n_steps, n_states))
mcmc = mh.msm.timescales.propagate_MCMC(traj, lagtime, n_mcmc_steps)

Out[39]:

<TimeitResult : 5.41 ms ± 63.7 µs per loop (mean ± std. dev. of 5 runs, 1 loop each)>

In [40]:

Copied!

time_mcmc_mh = _
time_mcmc_mh = _

In [41]:

Copied!

%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
mcmc = pyemma.msm.estimate_markov_model(traj, lag=lagtime).generate_traj(n_mcmc_steps)
%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
mcmc = pyemma.msm.estimate_markov_model(traj, lag=lagtime).generate_traj(n_mcmc_steps)

Out[41]:

<TimeitResult : 457 ms ± 3.66 ms per loop (mean ± std. dev. of 5 runs, 1 loop each)>

In [42]:

Copied!

time_mcmc_pyemma = _
time_mcmc_pyemma = _

In [43]:

Copied!





%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
traj = mh.utils.rename_by_index(traj)
mcmc = deeptime.markov.msm.MaximumLikelihoodMSM(
    reversible=True,
).fit_fetch(
    deeptime.markov.TransitionCountEstimator(
        lagtime=lagtime,
        count_mode='sliding',
    ).fit_fetch(traj),
    lagtime=lagtime,
).simulate(n_mcmc_steps)
%%timeit -r 5 -n 1 -q -o traj = generate_traj(n_steps, n_states)
traj = mh.utils.rename_by_index(traj)
mcmc = deeptime.markov.msm.MaximumLikelihoodMSM(
    reversible=True,
).fit_fetch(
    deeptime.markov.TransitionCountEstimator(
        lagtime=lagtime,
        count_mode='sliding',
    ).fit_fetch(traj),
    lagtime=lagtime,
).simulate(n_mcmc_steps)

Out[43]:

<TimeitResult : 17.3 ms ± 206 µs per loop (mean ± std. dev. of 5 runs, 1 loop each)>

In [44]:

Copied!

time_mcmc_deeptime = _
time_mcmc_deeptime = _

In [45]:

Copied!





visualize_benchmark_results(
    (time_mcmc_pyemma, time_mcmc_deeptime, time_mcmc_mh),
    ('PyEMMA', 'deeptime', 'msmhelper'),
    f'Benchmarking Markov Chain Monte Carlo Propagation',
)
visualize_benchmark_results(
    (time_mcmc_pyemma, time_mcmc_deeptime, time_mcmc_mh),
    ('PyEMMA', 'deeptime', 'msmhelper'),
    f'Benchmarking Markov Chain Monte Carlo Propagation',
)