cma.logger.CMADataLogger

class documentation

class CMADataLogger(interfaces.BaseDataLogger):

Constructor: CMADataLogger(name_prefix, modulo, append, expensive_modulo)

data logger for class CMAEvolutionStrategy.

The logger is identified by its name prefix and (over-)writes or reads according data files. Therefore, the logger must be considered as global variable with unpredictable side effects, if two loggers with the same name and on the same working folder are used at the same time.

Examples

import cma

es = cma.CMAEvolutionStrategy(12 * [3], 4)
es.optimize(cma.ff.elli, callback=es.logger.plot)

plots into the current matplotlib figure (or opens one if none exists) via a generic callback. es.optimize already adds by default es.logger.add to its callback list to add data to the logger. This call:

x, es = cma.fmin2(cma.ff.elli, 12 * [3], 4, {'verb_plot': 1})

is very similar, but plots hard-coded into figure number 324:

es = cma.CMAEvolutionStrategy(...)
logger = cma.CMADataLogger().register(es)
while not es.stop():
    ...
    logger.add()  # add can also take an argument

logger.plot() # or a short cut can be used:
cma.plot()  # plot data from logger with default name into new figure

logger2 = cma.CMADataLogger('just_another_filename_prefix').load()
logger2.plot()
logger2.disp()

import cma
from matplotlib.pylab import *
res = cma.fmin(cma.ff.sphere, rand(10), 1e-0)
logger = res[-1]  # the CMADataLogger
logger.load()  # by "default" data are on disk
semilogy(logger.f[:,0], logger.f[:,5])  # plot f versus iteration, see file header
cma.s.figshow()

Details

After loading data, the logger has the attributes xmean, xrecent, std, f, D and corrspec corresponding to xmean, xrecentbest, stddev, fit, axlen and axlencorr filename trails.

See Also
`disp` (), `plot` ()

Method	`__init__`	initialize logging of data from a `CMAEvolutionStrategy` instance.
Method	`add`	append some logging data from `CMAEvolutionStrategy` class instance `es`, if `number_of_times_called % modulo` equals to zero, never if `modulo==0`.
Method	`disp`	displays selected data from (files written by) the class `CMADataLogger`.
Method	`disp_header`	Undocumented
Method	`downsampling`	rude downsampling of a `CMADataLogger` data file by `factor`, keeping also the first `first` entries. This function is a stump and subject to future changes. Return self.
Method	`figclose`	Undocumented
Method	`initialize`	reset logger, overwrite original files, `modulo`: log only every modulo call
Method	`load`	load (or reload) data from output files, `load` is called in `plot` and `disp`.
Method	`plot`	plot data from a `CMADataLogger` using files written by the logger.
Method	`plot_all`	Obsolete: plot data from a `CMADataLogger` (using the files written by the logger).
Method	`plot_axes_scaling`	Undocumented
Method	`plot_correlations`	spectrum of correlation or precision matrix and percentiles of off-diagonal entries
Method	`plot_divers`	plot fitness, sigma, axis ratio...
Method	`plot_estimated_errors`	[WIP] experimental
Method	`plot_mean`	Undocumented
Method	`plot_sensitivities`	[WIP] experimental
Method	`plot_sigvec`	plot (outer) scaling from diagonal decoding.
Method	`plot_stds`	`iabscissa=1` means vs function evaluations, `idx` picks variables to plot
Method	`plot_xrecent`	Undocumented
Method	`popsize`	return median popsize from iteration and evaluation data
Method	`register`	register a `CMAEvolutionStrategy` instance for logging, `append=True` appends to previous data logged under the same name, by default previous data are overwritten.
Method	`save`	data are saved to disk the moment they are added
Method	`save_to`	saves logger data to a different set of files, for `switch=True` also the loggers name prefix is switched to the new value
Method	`select_data`	keep only data of `iteration_indices`
Method	`zip`	write logger data to file `filename [+ ...] + '.tar.gz'`.
Class Variable	`default_prefix`	Undocumented
Instance Variable	`append`	append to previous data
Instance Variable	`counter`	number of calls to `add`
Instance Variable	`data_version`	Undocumented
Instance Variable	`es`	Undocumented
Instance Variable	`expensive_modulo`	log also values that need an eigendecomposition to be generated every `expensive` iteration
Instance Variable	`fighandle`	Undocumented
Instance Variable	`file_names`	used in load, however hard-coded in add, because data must agree with name
Instance Variable	`file_parameters`	Undocumented
Instance Variable	`file_stoppings`	Undocumented
Instance Variable	`key_names`	used in load, however hard-coded in plot
Instance Variable	`last_correlation_spectrum`	Undocumented
Instance Variable	`last_iteration`	Undocumented
Instance Variable	`last_precision_matrix`	Undocumented
Instance Variable	`last_skipped_iteration`	Undocumented
Instance Variable	`modulo`	how often to record data, allows calling `add` without args
Instance Variable	`name_prefix`	Undocumented
Instance Variable	`original_fontsize`	Undocumented
Instance Variable	`parameters`	content of parameters.json2 file as a dict
Instance Variable	`persistent_communication_dict`	Undocumented
Instance Variable	`registered`	Undocumented
Instance Variable	`relative_allowed_time_for_plotting`	Undocumented
Instance Variable	`skip_finalize_plotting`	Undocumented
Instance Variable	`stoppings`	content of stoppings.json2 file as a `str`
Instance Variable	`timer_all`	Undocumented
Instance Variable	`timer_plot`	Undocumented
Instance Variable	`x`	Undocumented
Property	`data`	return dictionary with data.
Property	`mueff`	Undocumented
Method	`_enter_plotting`	assumes that a figure is open
Method	`_finalize_plotting`	Undocumented
Method	`_plot_x`	see def plot(...
Method	`_unzip`	to unzip use "tar -xf filename" in a system shell
Method	`_xlabel`	Undocumented
Instance Variable	`_eigen_counter`	Undocumented
Instance Variable	`_key_names_with_annotation`	used in load to add one data row to be modified in plot
Property	`_sensitivities`	return for each coordinate `len(self.data['f'][:, 0])` sensitivities over time

Inherited from BaseDataLogger:

Instance Variable	`filename`	file to save to or load from unless specified otherwise
Instance Variable	`optim`	object instance to be logging data from
Instance Variable	`_data`	`dict` of logged data

def __init__(self, name_prefix=default_prefix, modulo=1, append=False, expensive_modulo=1): ¶

overrides cma.interfaces.BaseDataLogger.__init__

initialize logging of data from a CMAEvolutionStrategy instance.

Default modulo=1 means logging with each call of add. If append is True data is appended to already existing data of a logger with the same name. Additional eigendecompositions are allowed only every expensive_modulo-th logged iteration.

def add(self, es=None, more_data=(), modulo=None): ¶

overrides cma.interfaces.BaseDataLogger.add

append some logging data from CMAEvolutionStrategy class instance es, if number_of_times_called % modulo equals to zero, never if modulo==0.

more_data is a list of additional data to be recorded where each data entry must have the same length.

When used for a different optimizer class, this function can be (easily?) adapted by changing the assignments under INTERFACE in the implemention.

def disp(self, idx=100): ¶

overrides cma.interfaces.BaseDataLogger.disp

displays selected data from (files written by) the class CMADataLogger.

Arguments

idx

indices corresponding to rows in the data file; if idx is a scalar (int), the first two, then every idx-th, and the last three rows are displayed. Too large index values are removed. If idx=='header', the header line is printed.

Example

>>> import cma, numpy as np
>>> res = cma.fmin(cma.ff.elli, 7 * [0.1], 1, {'verb_disp':1e9})  # generate data
...  #doctest: +ELLIPSIS
(4...
>>> assert res[1] < 1e-9
>>> assert res[2] < 4400
>>> l = cma.CMADataLogger()  # == res[-1], logger with default name, "points to" above data
>>> l.disp([0,-1])  # first and last
...  #doctest: +ELLIPSIS
Iterat Nfevals  function value    axis ratio maxstd  minstd...
>>> l.disp(20)  # some first/last and every 20-th line
...  #doctest: +ELLIPSIS
Iterat Nfevals  function value    axis ratio maxstd  minstd...
>>> l.disp(np.r_[0:999999:100, -1]) # every 100-th and last
...  #doctest: +ELLIPSIS
Iterat Nfevals  function value    axis ratio maxstd  minstd...
>>> l.disp(np.r_[0, -10:0]) # first and ten last
...  #doctest: +ELLIPSIS
Iterat Nfevals  function value    axis ratio maxstd  minstd...
>>> cma.disp(l.name_prefix, np.r_[0:9999999:100, -10:])  # the same as l.disp(...)
...  #doctest: +ELLIPSIS
Iterat Nfevals  function value    axis ratio maxstd  minstd...

Details

The data line with the best f-value is displayed as last line.

Use CMADataLogger.disp if the logger does not have the default name.

See Also
`CMADataLogger.disp`, `CMADataLogger.disp`

def disp_header(self): ¶

Undocumented

def downsampling(self, factor=10, first=3, switch=True, verbose=False): ¶

rude downsampling of a CMADataLogger data file by factor, keeping also the first first entries. This function is a stump and subject to future changes. Return self.

Arguments

factor -- downsampling factor

first -- keep first first entries

switch -- switch the new logger to the downsampled logger

original_name+'down'

Details

self.name_prefix+'down' files are written

Example

import cma
cma.downsampling()  # takes outcma/cma* files
cma.plot('outcma/cmadown')

def figclose(self): ¶

Undocumented

def initialize(self, modulo=None): ¶

reset logger, overwrite original files, modulo: log only every modulo call

def load(self, filenameprefix=None): ¶

overrides cma.interfaces.BaseDataLogger.load

load (or reload) data from output files, load is called in plot and disp.

Argument filenameprefix is the filename prefix of data to be loaded (5-8 files), by default 'outcmaes/' (where / stands for the OS-specific path seperator).

Return self with (added) attributes xrecent, xmean, f, D, std, corrspec, sigvec.

def plot(self, fig=None, iabscissa=0, iteridx=None, plot_mean=False, foffset=1e-19, x_opt=None, fontsize=7, downsample_to=10000000.0, xsemilog=False, xnormalize=False, xtransform=None, fshift=0, addcols=None, load=True, message='', **kwargs): ¶

overrides cma.interfaces.BaseDataLogger.plot

plot data from a CMADataLogger using files written by the logger.

Arguments

fig: figure number, by default starting from 325

iabscissa or abscissa: 0 ==> plot versus iteration count, 1 ==> plot versus function evaluation number

iteridx: the iteration indices to plot, e.g. range(100) for the first 100 evaluations.

plot_mean:bool indicates whether to plot the current best x or the mean.

foffset:float a small value added to f values to improve log-plot appearance.

downsample_to: number of data lines to use at most

x_opt: plot x-values as a difference to another solution and/or select a subset of variables to plot.

If x_opt is a nonnegative int, it is interpreted as iteration number and the difference of x to the respective (closest but not larger) iteration is plotted.

If x_opt is a negative int the respective index rather than the iteration is used. Namely and in particular, x_opt=0 subtracts the initial solution X0 and x_opt=-1 subtracts the final solution of the data.

If len(x_opt) == dimension or x_opt is a float, the difference to x_opt is plotted.

Otherwise: x_opt[0] defines the indices of the variables to be plotted where x_opt[1], if present, is subtracted.

xsemilog:bool: customized semilog plot for x-values

xnormalize:bool: divide x-values by standard deviations, may show stationary: behavior when the optimum is in zero

xtransform:callable is a function that returns transformed x-values, applied before x_opt is subtracted and xnormalize is done.

fshift:float is added to all displayed fitness values

addcols:int is the number of added columns for additional subplots

load:bool=True: load/reload data before to try plotting. Useful as a CMADataLogger instance itself does not load data by default.

message:str: a message text appearing in the plot

Return CMADataLogger itself.

Example

import cma
logger = cma.CMADataLogger()  # with default name
logger.plot()
cma.s.figsave('fig325.png')  # save current figure
logger.figclose()  # or cma.s.figclose()

Details

Data from codes in other languages (C, Java, Matlab, Scilab) have the same format and can be plotted just the same.

Dependencies: matlabplotlib.pyplot

See Also
`cma.plot` alias `cma.logger.plot`

def plot_all(self, fig=None, iabscissa=0, iteridx=None, foffset=1e-19, x_opt=None, fontsize=7): ¶

Obsolete: plot data from a CMADataLogger (using the files written by the logger).

Superseded by CMADataLogger.plot

Arguments

See CMADataLogger.plot

Return CMADataLogger itself.

def plot_axes_scaling(self, iabscissa=0): ¶

Undocumented

def plot_correlations(self, iabscissa=0, name='corrspec'): ¶

spectrum of correlation or precision matrix and percentiles of off-diagonal entries

def plot_divers(self, iabscissa=0, foffset=1e-19, fshift=0, message=None): ¶

plot fitness, sigma, axis ratio...

Parameters
iabscissa	0 means vs evaluations, 1 means vs iterations
foffset	added to f-value after abs(f) is taken
fshift	Undocumented
message	Undocumented
See Also
`plot`

def plot_estimated_errors(self, x_opt=None, markerscaler=10, log_threshold=0.05, dim_exponent=1, multiplier=None, plot_kwargs={}): ¶

[WIP] experimental

multiplier takes dimension and mueff as input and computes the error multiplier. The soon to be expired default value was dim**0.5 * 3 / min((popsize, 3 * dim + 15)).

The trust is defined as -max_i(Delta log(sigma * D_ii)) and is high when all axes diminish.

def plot_mean(self, iabscissa=0, x_opt=None, annotations=None, xsemilog=None, xnormalize=None): ¶

Undocumented

def plot_sensitivities(self, plot_absolute=False, gradf=None): ¶

[WIP] experimental

Plot IQR(f) / IQR(std_i). The donimator is computed as 1.349 x std_i x sqrt(n). The sqrt(n) accounts for the effect of the changes in all other variables on f.

def plot_sigvec(self, iabscissa=0, idx=None): ¶

plot (outer) scaling from diagonal decoding.

iabscissa=1 plots vs function evaluations

idx picks variables to plot if len(idx) < N, otherwise it picks iteration indices (in case, after downsampling).

def plot_stds(self, iabscissa=0, idx=None): ¶

iabscissa=1 means vs function evaluations, idx picks variables to plot

def plot_xrecent(self, iabscissa=0, x_opt=None, annotations=None, xsemilog=None, xnormalize=None): ¶

Undocumented

def popsize(self, data_field='f'): ¶

return median popsize from iteration and evaluation data

def register(self, es, append=None, modulo=None): ¶

overrides cma.interfaces.BaseDataLogger.register

register a CMAEvolutionStrategy instance for logging, append=True appends to previous data logged under the same name, by default previous data are overwritten.

def save(self, name=None): ¶

overrides cma.interfaces.BaseDataLogger.save

data are saved to disk the moment they are added

def save_to(self, nameprefix, switch=False): ¶

saves logger data to a different set of files, for switch=True also the loggers name prefix is switched to the new value

def select_data(self, iteration_indices): ¶

keep only data of iteration_indices

def zip(self, filename=None, unique=True): ¶

write logger data to file filename [+ ...] + '.tar.gz'.

filename defaults to self.name_prefix.

When unique is true, filename is furnished with a unique time stamp.

Return the path (relative or absolute) of the output file.

This function does in essence just tar -czf filename.tar.gz folder where folder defaults to the current plot data folder and filename is created to be unique.

TODO: we may want to be able to add a time stamp to the folder name too, to prevent conficts later?

default_prefix = ¶

Undocumented

append = ¶

append to previous data

counter: int = ¶

number of calls to add

data_version = ¶

Undocumented

es = ¶

Undocumented

expensive_modulo = ¶

log also values that need an eigendecomposition to be generated every expensive iteration

fighandle = ¶

Undocumented

file_names: tuple[str, ...] = ¶

used in load, however hard-coded in add, because data must agree with name

file_parameters: str = ¶

Undocumented

file_stoppings: str = ¶

Undocumented

key_names: tuple[str, ...] = ¶

used in load, however hard-coded in plot

last_correlation_spectrum: dict = ¶

Undocumented

last_iteration = ¶

Undocumented

last_precision_matrix = ¶

Undocumented

last_skipped_iteration = ¶

Undocumented

modulo = ¶

how often to record data, allows calling add without args

name_prefix = ¶

Undocumented

original_fontsize = ¶

Undocumented

parameters = ¶

content of parameters.json2 file as a dict

persistent_communication_dict = ¶

Undocumented

registered: bool = ¶

Undocumented

relative_allowed_time_for_plotting = ¶

Undocumented

skip_finalize_plotting: bool = ¶

Undocumented

stoppings = ¶

content of stoppings.json2 file as a str

timer_all = ¶

Undocumented

timer_plot = ¶

Undocumented

x = ¶

Undocumented

@property
data = ¶

overrides cma.interfaces.BaseDataLogger.data

return dictionary with data.

If data entries are None or incomplete, consider calling .load().data to (re-)load the data from files first.

@property
mueff = ¶

Undocumented

def _enter_plotting(self, fontsize=7): ¶

assumes that a figure is open

def _finalize_plotting(self, subplot_adjust=True): ¶

Undocumented

def _plot_x(self, iabscissa=0, x_opt=None, remark=None, annotations=None, xsemilog=None, xnormalize=False): ¶

see def plot(...

def _unzip(self, filename): ¶

to unzip use "tar -xf filename" in a system shell

and afterwards:

logger = cma.CMADataLogger(extacted_folder_name).plot()

in a Python shell.

def _xlabel(self, iabscissa=0): ¶

Undocumented

_eigen_counter: int = ¶

Undocumented

_key_names_with_annotation: tuple[str, ...] = ¶

used in load to add one data row to be modified in plot

@property
_sensitivities = ¶

return for each coordinate len(self.data['f'][:, 0]) sensitivities over time