Standard Convolutional Models#

Models that use standard (non-dilated) 1D convolutions with BasicEncodingMixin for simpler inference.

Series2Vec#

Temporal encoding via SoftDTW-based contrastive loss.

class chronocratic.models.convolutional.standard.series2vec.model.Series2Vec(input_dims: int, embedding_dims: int, num_heads: int, feedforward_dims: int, representation_dims: int, dropout_rate: float, encoder_kernel_size: int = 8, learning_rate: float = 0.001, soft_dtw_gamma: float = 0.1, *, sync_dist: bool = False, optimizer_name: str = 'RAdam', weight_decay: float = 0.0)#

Bases: LightningModule, BasicEncodingMixin

Lightning wrapper for Series2Vec pretraining.

The public input shape is (batch, time, channels).

This model was implemented based on the code available on this GitHub repo Navidfoumani/Series2Vec.

forward(x: Tensor) → Tensor#: Return Series2Vec representations for x.

property encoder: Module#: Return the Series2Vec network for inspection and checkpointing.

training_step(batch: Tensor, _batch_idx: int) → Tensor#: Compute and log the Series2Vec pretraining loss for one batch.

validation_step(batch: Tensor, _batch_idx: int) → Tensor#: Compute and log the Series2Vec validation loss for one batch.

configure_optimizers() → Optimizer#: Return the configured optimizer for Series2Vec pretraining.

property representation_dim: int#

Flattened representation size (temporal + frequency concatenated).

Returns:: 2 * representation_dims — the output dimension of Series2VecNetwork.encode().

Configuration for the Series2Vec model.

Provides Series2VecModelParameters with all settings for the dual time/frequency Series2Vec encoder, soft-DTW target computation, and optimizer choice.

class chronocratic.models.convolutional.standard.series2vec.config.Series2VecModelParameters(*, input_dims: int, embedding_dims: int, num_heads: int, feedforward_dims: int, representation_dims: int, dropout_rate: float, encoder_kernel_size: int = 8, learning_rate: float = 0.001, soft_dtw_gamma: float = 0.1, sync_dist: bool = False, optimizer_name: Literal['Adam', 'RAdam', 'AdamW'] = 'RAdam', weight_decay: float = 0.0)#

Bases: object

Configuration for the Series2Vec model.

Parameters:

input_dims – Number of input features (channels) in the time series.
embedding_dims – Token embedding dimensionality.
num_heads – Number of attention heads in the transformer encoder.
feedforward_dims – Hidden dimensionality of the transformer feed-forward block.
representation_dims – Output dimensionality of the projection head used for pretraining.
dropout_rate – Dropout probability applied throughout the network.
encoder_kernel_size – Kernel size of the convolutional tokenizer.
learning_rate – Base learning rate for the optimizer.
soft_dtw_gamma – Smoothing parameter for the soft-DTW distance used as the temporal target.
sync_dist – Whether to synchronize logged metrics across distributed processes.
optimizer_name – Optimizer to use; one of 'Adam', 'RAdam', or 'AdamW'.
weight_decay – L2 weight-decay coefficient passed to the optimizer.

TSTCC#

Temporal contrastive clustering for representation learning.

class chronocratic.models.convolutional.standard.tstcc.model.TSTCC(input_channels: int, kernel_size: int, stride: int, final_out_channels: int, features_len: int, num_classes: int, dropout: float = 0.35, tc_hidden_dim: int = 100, tc_timesteps: int = 6, temperature: float = 0.2, *, use_cosine_similarity: bool = True, learning_rate: float = 0.0003, lambda1: float = 1.0, lambda2: float = 0.7, sync_dist: bool = False, augmentation: AugmentationProducer[ViewPair] | None = None)#

Bases: LightningModule, BasicEncodingMixin

PyTorch Lightning module for TS-TCC (self-supervised pretraining only).

Single-purpose model for temporal + contextual contrastive pre-training on augmented views. Labels are ignored during pretraining.

Batch format: (data, labels) where labels is ignored. Two augmented views of data are produced by the injected AugmentationProducer[ViewPair] (e.g. _default_tstcc_pair()), which provides Gaussian scaling (weak) and segment-permutation + jitter (strong) views.

Uses automatic_optimization = False because two separate optimizers (one per sub-module) must be stepped independently.

For downstream classification or regression, use SupervisedModule from chronocratic.models.supervised.

This model was implemented based on the code available on this GitHub repo emadeldeen24/TS-TCC under MIT License.

forward(x: Tensor) → tuple[Tensor, Tensor]#: Run the encoder. Returns (logits, features).

training_step(batch: tuple[Tensor, Tensor], _batch_idx: int) → Tensor#: Manual optimization step for both sub-module optimizers.

validation_step(batch: tuple[Tensor, Tensor], _batch_idx: int) → Tensor#: Compute and log validation loss.

configure_optimizers() → OptimizerLRScheduler#: Return one Adam optimizer per sub-module (encoder and TC model).

property encoder: Module#: Return the TCC encoder for inspection and checkpointing.

property representation_dim: int#

Flattened pre-logits feature size of the TCC encoder.

Returns:: The input dimension of the encoder’s logits nn.Linear layer, which equals final_out_channels * features_len.

Configuration for the TS-TCC model.

Provides TSTCCModelParameters with all settings for the TCC encoder, temporal contrast head, and NT-Xent contextual loss for self-supervised pretraining.

class chronocratic.models.convolutional.standard.tstcc.config.TSTCCModelParameters(*, input_channels: int, kernel_size: int, stride: int, final_out_channels: int, features_len: int, num_classes: int, dropout: float = 0.35, tc_hidden_dim: int = 100, tc_timesteps: int = 6, temperature: float = 0.2, use_cosine_similarity: bool = True, learning_rate: float = 0.0003, lambda1: float = 1.0, lambda2: float = 0.7, sync_dist: bool = False)#

Bases: object

Configuration for the TS-TCC model.

Parameters:

input_channels – Number of input features (channels) in the time series.
kernel_size – Convolutional kernel size used in the TCC encoder.
stride – Convolutional stride used in the TCC encoder.
final_out_channels – Number of channels produced by the final encoder block (also used as the temporal-contrast input dim).
features_len – Length of the encoder feature map fed into the logits head.
num_classes – Number of output classes for the encoder logits head.
dropout – Dropout probability applied inside the TCC encoder.
tc_hidden_dim – Hidden dimensionality of the temporal-contrast module.
tc_timesteps – Number of future timesteps the temporal-contrast module predicts.
temperature – Temperature scaling for the NT-Xent contextual loss.
use_cosine_similarity – Whether the NT-Xent loss uses cosine similarity (otherwise dot-product).
learning_rate – Base learning rate for the two Adam optimizers (encoder and temporal-contrast).
lambda1 – Weight of the temporal-contrast loss term in the self-supervised objective.
lambda2 – Weight of the contextual NT-Xent loss term in the self-supervised objective.
sync_dist – Whether to synchronize logged metrics across distributed processes.

FCN (MCL)#

Multi-scale contrastive learning with a minimal FCN architecture.

class chronocratic.models.convolutional.standard.mcl.model.FCN(n_in: int, output_dims: int = 320, alpha: float = 1.0, learning_rate: float = 0.001)#

Bases: LightningModule, BasicEncodingMixin

FCN encoder for Mixup Contrastive Learning (MCL).

This model was implemented based on the code available on this GitHub repo Wickstrom/MixupContrastiveLearning.

property encoder: Module#: Return the FCN encoder.

forward(x: Tensor) → Tensor#: Return projected MCL representations for x.

training_step(batch: Tensor, _batch_idx: int) → Tensor#: Compute and log the training loss for one batch.

validation_step(batch: Tensor, _batch_idx: int) → Tensor#: Compute and log the validation loss for one batch.

configure_optimizers() → Optimizer#: Return the Adam optimizer used to train MCL.

Configuration for the MCL (MixUp Contrastive Learning) model.

Provides MCLModelParameters with MCL-specific settings for the FCN encoder and MixUp contrastive criterion.

class chronocratic.models.convolutional.standard.mcl.config.MCLModelParameters(*, n_in: int, output_dims: int = 320, alpha: float = 1.0, learning_rate: float = 0.001)#

Bases: object

Configuration for the MCL model.

Parameters:

n_in – Number of input features (channels) in the time series.
output_dims – Number of output features produced by the encoder.
alpha – Beta-distribution parameter controlling the MixUp interpolation coefficient.
learning_rate – Base learning rate for the Adam optimizer.