alibi.models.pytorch.actor_critic module

This module contains the Pytorch implementation of actor-critic networks used in the Counterfactual with Reinforcement Learning for both data modalities. The models’ architectures follow the standard actor-critic design and can have broader use-cases.

class alibi.models.pytorch.actor_critic.Actor(hidden_dim, output_dim)[source]

Bases: Module

Actor network. The network follows the standard actor-critic architecture used in Deep Reinforcement Learning. The model is used in Counterfactual with Reinforcement Learning (CFRL) for both data modalities (images and tabular). The hidden dimension used for the all experiments is 256, which is a common choice in most benchmarks.

__init__(hidden_dim, output_dim)[source]

Constructor.

Parameters:

hidden_dim (int) – Hidden dimension.
output_dim (int) – Output dimension

forward(x)[source]

Forward pass

Parameters:: x (Tensor) – Input tensor.
Return type:: Tensor
Returns:: Continuous action.

class alibi.models.pytorch.actor_critic.Critic(hidden_dim)[source]

Bases: Module

Critic network. The network follows the standard actor-critic architecture used in Deep Reinforcement Learning. The model is used in Counterfactual with Reinforcement Learning (CFRL) for both data modalities (images and tabular). The hidden dimension used for the all experiments is 256, which is a common choice in most benchmarks.

__init__(hidden_dim)[source]

Constructor.

Parameters:: hidden_dim (int) – Hidden dimension.

forward(x)[source]

Forward pass.

Parameters:: x (Tensor) – Input tensor.
Return type:: Tensor
Returns:: Critic value.