Algorithms, Design, Code and more: pytorch

Showing posts with label pytorch. Show all posts

Friday, December 19, 2025

Reinforcement Learning

An important ML training paradigm is Reinforcement Learning (RL). RL models rely on a reward value generated at the end of each training run/ epoch to update the parameters (weights) of the model. This is different from the other ML methods such as Supervised Learning where labelled data/ examples are given from which the models learns. It's is also different from the Unsupervised Learning approach where inherent features of the unlabeled data are explored by the model through the learning phase to identify clusters, etc.

The keras-io examples has some RL implementations such as actor_critic, ppo, etc. All of them work solely with the TensorFlow (tf) backend. In keras_io_examples_rl these have been ported to the Torch/ PyTorch backend. The typical changes include:

Torch Imports
Use torch specific Optimizer - torch.optim.Adam

deep_q_network_breakout_pytorch () requires grad_clipping, in torch done before optimizer.step()

Gradient computations in torch

Replace tf GradientTape with torch autograd
Disable gradient globally torch.set_grad_enabled(False)
Enable autograd within specific flows/ methods where needed
Call loss.backward(), optimizer.step() for backpropagation

Few torch specific tensor & function changes/ wrappers

The ported pytorch compatible files are:

References

http://www.derongliu.org/adp/adp-cdrom/Barto1983.pdf
https://hal.inria.fr/hal-00840470/document
https://link.springer.com/content/pdf/10.1007/BF00992698.pdf
https://www.semanticscholar.org/paper/Human-level-control-through-deep-reinforcement-Mnih-Kavukcuoglu/340f48901f72278f6bf78a04ee5b01df208cc508
Continuous control with deep reinforcement learning: https://arxiv.org/abs/1509.02971)
Deep Deterministic Policy Gradient (DDPG)
https://gymnasium.farama.org/
Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto

Wednesday, November 26, 2025

Explainable AI

With widespread adoption of large Machine Learning (ML) models, there's a real need for understanding the workings of the models. The model otherwise just appears to be a black-box doing its thing without the end user really knowing the whys/ hows behind the models responses, choices, decisions, etc. Looking inside the model - the white-box approach - while possible is simply not practical for 99.99..9% users.

Local Interpretable Model-Agnostic Explanations (LIME) & Shapley Additive Explanations (SHAP) are two black-box techniques that help explaining the workings of such models. The key idea behind both being:

To generate some (synthetic) input data from actual data with some of the features (such as income, age, etc) of the data altered at random.
Then to use the generated input data with the model and use the output to understand the effects of the altered features (one or more/ combinations) on the output.Thereby, understand the importance/ relevance of the features on the outputs of the model.
For e.g. In a loan approval/ rejection scenario by altering two features income levels & gender in the input and testing one might discover that Income levels has an effect on the decision, but no gender.

With that background, let's look at SHAP for language models that take texts as input. Here features are the words (tokens) that comprise the input string.

For an input like: "Glad to see you",

The features are: "Glad", "to", "see", "you"

Shap would explain the impact of each word (token) on the output of the model by passing in various altered data with words MASKED:
"* to see you", "Glad to * you", ...

TextClassificationTorchShap.py shows how SHAP works with the Text Classification Model trained using the Imdb dataset. The code requires shap to be installed:

pip3 install shap

In terms of its working it loads up the pre-trained Text Classification model and vocabulary. Next it plugs in to the shap library using a shap custom tokenizer to generate token_ids & offsets for the given input data.

masker = maskers.Text(custom_tokenizer, mask_token=SPECIAL_TOKEN_UNK)
explainer = shap.Explainer(predict,masker=masker)

Finally, shap is called with some sample input text which has words masked at random. Shap collects the outputs which can be used to generate a visual report of the impact of the different words as seen below.

The model classifies any given input text as either POSITIVE (score near 1) or NEGATIVE (score near 0). The figure is showing output for two input data: "This is a great one to watch." & "What a long drawn boring affair to the end credits."

Let's look first at "This is a great one to watch.":

There is a base value = 0.539161 which is the model's output for a completely MASKED out input, i.e. "* * * * * * *"
The words "to w..", "This is" move up the score to 0.7
In addition, the words "a great" move up the score to 0.996787, the actual output of the model for the complete input text "This is a great one to watch."
The model rightly classifies this as POSITIVE with a score of 0.996787 (close to 1)

Similarly for the text "What a long drawn boring affair to the end credits.":

Completely masked base value = 0.539161.
The key words in this case are "boring affair to the".
The text is rightly classified as NEGATIVE with a score of 0.0280297 (close to 0).

Saturday, November 22, 2025

Text Classification from Scratch using PyTorch

The AI/ ML development framework Keras 3x in recent times has introduced support for Torch & Jax backends in addition to Tensorflow. However, given Keras's Tensorflow legacy large sections of the code are deeply integrated with Tensorflow.

One such piece of code is the text_classification_from_scratch.py from keras-io/examples project. Without tensorflow this piece of code simply doesn't run!

Here's text_classification_torch.py a Torch/ PyTorch port of the same code. The bits that needed modification were:

Removing all tensorflow related imports
Loading the Imdb text files in "grain" format in place of "tf" format, by passing the appropriate param:

keras.utils->text_dataset_from_directory(format="grain")

Which obviously needed grain to be installed:

pip3 install grain

Using torchtext for Vocab/ Tokenizer/ Vectorizing :

pip3 install torchtext

Few other changes such as ensure max_features constraint's honoured, text is standardized, padded, and so on.