htmd.adaptive.adaptivegoal module#

class htmd.adaptive.adaptivegoal.AdaptiveGoal#

Bases: AdaptiveMD

Adaptive class which uses a Markov state model for respawning

AdaptiveMD uses Markov state models to choose respawning poses for the next epochs. In more detail, it projects all currently retrieved simulations according to the specified projection, clusters those and then builds a Markov model using the discretized trajectories. From the Markov model it then chooses conformations from the various states based on the chosen criteria which will be used for starting new simulations.

Parameters:
  • app (SimQueue object, default=None) – A SimQueue class object used to retrieve and submit simulations

  • project (str, default='adaptive') – The name of the project

  • nmin (int, default=0) – Minimum number of running simulations

  • nmax (int, default=1) – Maximum number of running simulations

  • nepochs (int, default=1000) – Stop adaptive once we have reached this number of epochs

  • nframes (int, default=0) – Stop adaptive once we have simulated this number of aggregate simulation frames.

  • inputpath (str, default='input') – The directory used to store input folders

  • generatorspath (str, default='generators') – The directory containing the generators

  • dryrun (boolean, default=False) – A dry run means that the adaptive will retrieve and generate a new epoch but not submit the simulations

  • updateperiod (float, default=0) – When set to a value other than 0, the adaptive will run synchronously every updateperiod seconds

  • coorname (str, default='input.coor') – Name of the file containing the starting coordinates for the new simulations

  • lock (bool, default=False) – Lock the folder while adaptive is ongoing

  • datapath (str, default='data') – The directory in which the completed simulations are stored

  • filter (bool, default=True) – Enable or disable filtering of trajectories.

  • filtersel (str, default='not water') – Atom selection string for filtering. See more here

  • filteredpath (str, default='filtered') – The directory in which the filtered simulations will be stored

  • projection (Projection object, default=None) – A Projection class object or a list of objects which will be used to project the simulation data before constructing a Markov model

  • truncation (str, default=None) – Method for truncating the prob distribution (None, ‘cumsum’, ‘statecut’

  • statetype (('micro', 'cluster', 'macro'), str, default='micro') – What states (cluster, micro, macro) to use for calculations.

  • macronum (int, default=8) – The number of macrostates to produce

  • skip (int, default=1) – Allows skipping of simulation frames to reduce data. i.e. skip=3 will only keep every third frame

  • lag (int, default=1) – The lagtime used to create the Markov model. Units are in frames.

  • clustmethod (ClusterMixin class, default=<class ‘htmd.clustering.kcenters.KCenter’>) – Clustering algorithm used to cluster the contacts or distances

  • method (str, default='1/Mc') – Criteria used for choosing from which state to respawn from

  • ticalag (int, default=20) – Lagtime to use for TICA in frames. When using skip remember to change this accordinly.

  • ticadim (int, default=3) – Number of TICA dimensions to use. When set to 0 it disables TICA

  • contactsym (str, default=None) – Contact symmetry

  • save (bool, default=False) – Save the model generated

  • goalfunction (function, default=None) – This function will be used to convert the goal-projected simulation data to a ranking whichcan be used for the directed component of FAST.

  • ucscale (float, default=0.5) – Scaling factor for undirected component. Directed component scaling automatically calculated as (1-uscale)

  • nosampledc (bool, default=False) – Spawn only from top DC conformations without sampling

  • autoscale (bool, default=False) – Automatically scales exploration and exploitation ratios depending on how stuck the adaptive is at a given goal score.

Example

>>> crystalSS = MetricSecondaryStructure().project(Molecule('crystal.pdb'))[0]
>>>
>>> # First argument of a goal function always has to be a Molecule object
>>> def ssGoal(mol):
>>>     proj = MetricSecondaryStructure().project(mol)
>>>     ss_score = np.sum(proj == crystalSS, axis=1) / proj.shape[1]  # How many predicted SS match
>>>     return ss_score
>>>
>>> ag = AdaptiveGoal()
>>> ag.generatorspath = '../generators/'
>>> ag.nmin = 2
>>> ag.nmax = 3
>>> ag.projection = [MetricDistance('name CA', 'resname MOL', periodic='selections'), MetricDihedral()]
>>> ag.goalfunction = ssGoal
>>> ag.app = LocalGPUQueue()
>>> ag.run()
>>>
>>> # Or alternatively if we have a multi-argument goal function
>>> def ssGoalAlt(mol, ss):
>>>     proj = MetricSecondaryStructure().project(mol)
>>>     ss_score = np.sum(proj == ss, axis=1) / proj.shape[1]
>>>     return ss_score
>>> from joblib import delayed
>>> ag.goalfunction = delayed(ssGoalAlt)(crystalSS)
>>> ag.app = LocalGPUQueue()
>>> ag.run()