Estimate an optimal dynamic treatment regime using Interactive Q-learning.
conda install r_test::r-iqlearn