Karush Suri

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Page Not Found

Page not found. Your pixels are in another canvas.

About me

Jupyter notebook markdown generator

Posts

Discrete Stochastic Optimization

5 minute read

Published: May 17, 2024

This post will cover stochastic optimization with discrete latent random variables. Unlike continuous random variables, discrete random variables encode data in a few bits. This allows us to capture relevant information effectively. However, differentiating through discrete variables is challenging. We will look at the challenges posed in discrete stochastic optimization and reparameterization methods which overcome these challenges.

Data Parallelism in JAX

3 minute read

Published: March 11, 2024

This post will introduce data parallelism in JAX. Specifically, we will look at parallelizing matrices across multiple devices by leveraging pmap. This post is based on Misha Laskin’s data parallelism blog post.

Coarsening Graphs with Neural Networks

11 minute read

Published: October 11, 2021

With the rise of large-scale graphs for relational learning, graph coarsening emerges as a computationally viable alternative. We revisit the principles that aim to improve data-driven graph coarsening with adjustable coarsened structures.

Variational Generalization Bounds

8 minute read

Published: December 17, 2020

Recent advancements in generalization bounds have led to the development of tight information theoretic and data-dependent measures. Although generalization bounds reduce bias in estimates, they often suffer from tractability during empirical evaluation. The lack of a uniform criterion for estimation of Mutual Information (MI) and selection of divergence measures in conventional bounds hinders utility to sparse distributions. To that end, we revisit generalization through the lens of variational bounds. We identify hindrances based on bias, variance and learning dynamics which prevent accurate approximations of data distributions. Our empirical evaluation carried out on large-scale unsupervised visual recognition tasks highlights the necessity for variational bounds as generalization objectives for learning complex data distributions. Approximated estimates demonstrate low variance and improved convergence in comparison to conventional generalization bounds. Lastly, based on observed hindrances, we propose a theoretical alternative which aims to improve learning and tightness of variational generalization bounds. The proposed approach is motivated by contraction theory and yields a lower bound on MI.

Evolution-based Soft Actor Critic

6 minute read

Published: June 20, 2020

Concepts and applications of Reinforcement Learning (RL) have seen a tremendous growth over the past decade. These consist of applications in arcade games, board games and lately, robotic control tasks. Primary reason for this growth is the usage of computationally efficient function approximators such as neural networks. Modern-day RL algorithms make use of parallelization to reduce training times and boost agent's performance through effective exploration giving rise to scalable methods, commonly referred to as Scalable Reinforcement Learning (SRL). However, a number of open problems such as approximation bias, lack of scalability in the case of long time horizons and lack of diverse exploration restrict the application of SRL to complex control and robotic tasks.

Stacked Capsule Autoencoders

4 minute read

Published: May 21, 2020

Capsule Networks(CapsNet) have been growing in application and development ever since the radical breakthrough of Vector Capsules¹. The fundamental idea behind colecting more information about the presence of an object in an image such as its pose, angle and depth remain a non-trivial open problem. CapsNet is a step forward in the direction and addresses the issue by spatially abstracting and taking into account rotational equivariance. This post is a review of the recent direction proposed the new Stacked Capsule Autoencoder paper².

Benchmarking Policy Search using CyclicMDP

5 minute read

Published: December 04, 2019

Policy search is a crucial aspect in Reinforcement Learning as it directly relates to the optimization of the algorithm in weight space. Various environments are used for benchmarking policy search including the famous ATARI 2600 games and MuJoCo control suite. However, various environments have longer horizons which force the agent to perform better at continuous timesteps. This is often a non-trivial problem when dealing with policy-based approaches. This post introduces a new environment called the CyclicMDP which is a long horizon discrete problem presented to the agent. The environment is competitive to modern-day RL benchmarks is the sense that it presents the agent with a simple problem but at a very long (ideally infinite) horizon.

DQN with Atari in 6 Minutes

6 minute read

Published: August 23, 2019

This post will introduce Q-Learning, one of the most famous algorithms in Reinforcement Learning which utilizes Neural Networks as function approximators. All tutorials in the Reinforcement Learning sections require prior knowledge of Deep Neural Networks and it is recommended that you have a look at the notebooks in the Deep Learning section.

Capsule Networks for Digit Recognition in PyTorch

8 minute read

Published: December 14, 2018

We are going to implement the Capsule Network from the recent paper on Dynamic Routing Between Capsules. The network consists of the novel Primary and Digit Caps Layers which perform nested convolutions. Dynamic routing, or more specifically Routing by Agreement, takes place in the Digit Caps Capsule Layer which we will discuss in more detail. So let's get started and begin with or standard imports from the PyTorch module.

portfolio

Portfolio item number 1

Short description of portfolio item number 1

Portfolio item number 2

Short description of portfolio item number 2

publications

teaching

CSC258- Computer Organization

Teaching Assistant, Department of Computer Science, University of Toronto, Winter, 2020

Karush Suri

Karush Suri

Pages

Page Not Found

Archive Layout with Content

Posts by Category

Posts by Collection

Markdown

Page not in menu

Page Archive

Portfolio

Karush Suri

Posts by Tags

Talk map

Talks and presentations

Terms and Privacy Policy

Blog posts

Jupyter notebook markdown generator

Posts

Discrete Stochastic Optimization

Data Parallelism in JAX

Coarsening Graphs with Neural Networks

Variational Generalization Bounds

Evolution-based Soft Actor Critic

Stacked Capsule Autoencoders

Benchmarking Policy Search using CyclicMDP

DQN with Atari in 6 Minutes

Capsule Networks for Digit Recognition in PyTorch

portfolio

Portfolio item number 1

Portfolio item number 2

publications

talks

Talk 1 on Relevant Topic in Your Field

Tutorial 1 on Relevant Topic in Your Field

Talk 2 on Relevant Topic in Your Field

Conference Proceeding talk 3 on Relevant Topic in Your Field

teaching

CSC258- Computer Organization