Chimeraki
  • Home
  • Scientific Work
  • Visual Art
    • Behind The Lens
    • Canvas Chaotica
  • Movement Art
  • Spilled Ink
  • Travel
  • Contact Me

Science and Research 

Research Interests

Hi I'm Sanjukta Krishnagopal, a UC presidential postdoctoral fellow at UC Berkeley (where I am advised by Jennifer Chayes) and UCLA (where I am advised by Mason Porter). Before this I was at the Gatsby Computational Neuroscience Unit at University College London. I graduated with a physics PhD from University of Maryland (2016-2020) where I was advised by Michelle Girvan and was also a fellow of the CoMBiNe (Computation and Mathematics in Biological Networks) program. My research lies at the intersection of machine learning, network science, data science, and mathematical and computational biology. My interests are varied and interdiscplinary. My research can broadly be classified into
(1) Algorithms and inference on networks: studying spectral and topological properties of higher-order graphs (e.g. hypergraphs or simplicial complexes). Inference on graphs, and dynamics on graphs, applied to modeling social and biological systems.
(2) Predictive medicine: developing data-driven statistics and ML biologically-plausible models for predictive medicine and gene-integrated personalized therapy in a variety of heterogeneous longitudinal diseases, where disease subtypes are characterized by time-evolving interactions. between various different types of variables (data).
(3) Explainable AI and graph neural networks: Developing new machine learning architectures for explainable AI. I do this through using tools from dynamical systems for generalizable learning that exploits manifold structure of the data being trained on without overfitting. I also develop new learning rules (as alternatives to backprop) and training mechanisms for training neural networks, applied to a wide class of neural networks, including RNNs, and GNNs for faster and better learning. Lastly, I apply GNNs and DNNs to capture important features in biological data for disease prediction.
(4) Information spreading and data science: studying how information or relationships propagates in social systems - I study this in the context of opinion and disease spreading (which can be coupled process). I also study the spreading of influence in political systems, to underlying the process of bill-passing in Congress. I'm particularly interested in studying how opinion propagation results in exacerbation of various types of inequalities in society, and how interventions can be adapted in a social network.
(5) Computational neuroscience: understanding how learning and decision making happens in the brain through reinforcement learning and computational tools based on brain-inspired biologically plausible learning. Using computational models, built on brain structure, to derive insight into brain disorders, including mental health problems and disorders that affect memory and learning.

A short description of some recent project themes are found below (updated March 2022)

Current Position: UC Presidential Postdoc 
I'm on the job market! The presidential postdoc comes with a hiring incentive of $85k/year for 5 years to any UC department that hires me, as well as a search and advertising waiver for the position.

sanjukta_cv_march2023.pdf
File Size: 269 kb
File Type: pdf
Download File

Primary Research Themes

Machine Learning
A biologically plausible alternative to backpropagation for learning
How does learning occur? How do weights change? Machine learning's answer to this is backpropagation, however this is not biologically plausible, and networks trained with this rule tend to forget old tasks when learning new ones. We propose a novel learning rule as an alternative to backpropagation, that involves local learning rules only (each layer computes it's own error without backpropagating). Naively, this will likely not work, as it removes the effectiveness of multiple layers, however, we use a gating mechanism that segregates the weights based on input. i.e., if two inputs are close to each other, the set of weights that change have large overlap, and if two inputs are far away, then the set of weights that learn them have very little to no overlap. Dendritic Gated Networks are significantly more data efficient than conventional artificial networks and are highly resistant to forgetting, and we show that they perform well on a variety of tasks, in some cases better than backpropagation. The DGN bears similarities to the cerebellum and validates some experimental results from in vivo mouse cerebellar imaging.
I gave a talk on this work at Cosyne 2021  which had an audience of 1200 people! (and a 5% acceptance rate). This work is done in collaboration with Google Deepmind
Arxiv paper

​ENCODED PRIOR SLICED WASSERSTEIN AUTOENCODER: LEARNING LATENT MANIFOLD REPRESENTATIONS
I am interested in learning latent representations of which preserve the geometry and topology of the data. In a sense, embedding the entire data manifold. Naturally, this should vastly improve interpretability of latent space, and interpolation in it. In VAEs the use of conventional priors can limit their ability to encode the underlying structure of data.
here we introduce EPSWAE, where the latent representation is learned through a different network preserves the topological and geometric properties of the data manifold. The use of the Wasserstein distance (as opposed to the KL divergence) makes this possible. 
A second interest of mine is navigating and interpreting the latent space. If the latent space lies on a low-d manifold, it's natural to interpolate along geodesics such that intermediate points lie on the manifold. I propose a network-geodesics algorithm to do exactly that, as opposed to conventional linear interpolation in latent space.
Arxiv link
interpretable learning: Reservoir computing 
Reservoir computing is a machine learning architecture that is a popular model for foundational study of neural network dynamics - since the reservoir itself is a dynamical system. The connections between the neurons are recurrent, and follow a differential equation that can be solved (numerically) to reveal interesting dynamical states/attractors. Thus the learning is encoded in the dynamical state of the reservoir computer in response to the input, which is a radically different way of thinking of about learning. This property of reservoir computers makes them an ideal candidate for understanding how learning occurs, i.e., not a black box, and learn chaotic models (hence, widely popular in modeling weather patterns). A bonus is that they can be implemented in hardware!

My research  in reservoir computing spans a variety of topics that have resulted in three publications:
Generalization of learning using very little data, 
Understanding how learning occurs through attractors in the reservoir dynamics, 
​
Separating chaotic signals e.g. a chaotic version of the cocktail party problem.

graph neural networks: spectral analysis for improved features
Graph neural networks do many wonderful things with graphical data structures, but are still somewhat of a black box. I am interesting in understanding how the spectrum of the graph laplacian can be harnessed to improve learning in the GNN framework. There are many directions one may explore in this direction, one in particular that I am interested in is exploring the graph neural tangent kernel limit (wide limit), where there exist theoretical guarantees of a convex loss function. I am interested in understanding how growing a graph along directions that maximize eigenvalues may be related to the selection of features, and speed of convergence.

I also use graph neural networks in application such as predictive medicine. See this paper.

Topological Data analysis and machine learning for materials
Here I attempt to use computational tools to predict properties of compounds called metal-organic-frameworks (MOFs). These materials are incredibly useful in gas purification (like removing CO2 from the environment), gas storage (like storing ultracold gases), and even as superconductors. They naturally are incredibly useful in climate change related applications, however are difficult and time consuming to synthesize. By using machine learning and topological data analysis, I attempt to study the structures of existing MOFs, to predict which new ones (new combinations of metals and organic components) will have useful properties. This is ongoing joint work with Jennifer Chayes, Christian Borgs, and Aditi Krishnapriyan. 
Network and data science
Computational biology: personalized medicine and genomics
This theme of research involves computational methods for medical application. This involves applying various techniques from multilayer networks, graph neural networks and statistics to predict disease subtypes in patients early on, and consequently pre-emptively treat them. In particular, I developed the 'Trajectory Clustering' algorithm that identifies disease subtypes in heterogenous multivariate diseases, where conventional methods face two main challenges challenges (1) unable to capture time-evolution of interactions, (2) can't handle multiple types of data (ordinal, categorical, continuous, phenotypic and genetic etc.). I collaborate with several clinicians on Parkinson's and Stroke. Three papers have resulted from this theme
Publications:
Plos One paper.
Biomedical Physics and Engineering Express  paper.
Stroke paper.
​
and Talk at NetSci 2018, Paris  here
Picture
Higher order networks: simplicial complexes
 While graphs are a source of rich information about pairwise interactions, several real world networks involve interactions between more than two agents. For example three students meeting in a break room is a  simultaneous 3-way interaction (represented by a filled traingle), not 3 pairwise interactions. So many real networks are higher-order, and often misleadingly reduced to pairwise interactions. Simplicial complexes are powerful tools to model higher order interactions. The nice thing about them is that they have mathematical definitions and can be analyzed through the lens of topology and geometry. I am interested in developing a theoretical understanding of higher-order networks, particularly properties of the higher order (Hodge) Laplacian. I am also interested in various applications.

My most recent investigation involves spectral community detection in an arbitrary dimensional simplicial complex (any number of interacting nodes). I am also studying concepts such as 'holes' or cavities in higher order networks.
Manuscript: Sanjukta Krishnagopal and Ginestra Bianconi
Picture
The cooperative vs competitive dynamics in extreme mountaineering
I enjoy hiking, and am deeply fascinated by extreme mountaineering, even though I haven't actually brought myself to do anything too dangerous. I decided, as a pet data analysis project, to study the various factors both personal and expeditional (and their interactions) that contribute to success/ various types of failure at extremely high altitudes in the Everest ranges. These factors include oxygen use, age, sex, previous expedition etc. but also length of expedition, ratio of sherpas to paying climbers, number of high camps. This necessitated the use of a multiscale network and regression. This topic and the results are personally exciting! I will be presenting this work at Complex Networks 2021 in Madrid. Check out my two recent papers: this and that .
social dynamics coupled with disease spreading - segregation in society
Social relationships evolve constantly, changing patterns of interactions between individuals. Humans are known to be homophilic, i.e., we like to befriend people that are similar to ourselves. Hence, the social network of individuals changes with time, based on how the opinions of people around them align with their own. Conversely, individuals change their opinions, through interactions with others in their network. Lastly, the evolution of these relationships affects the ways in which contagious diseases spread. In this ongoing work, I develop models of opinion-dynamics coupled with disease spreading processes. I investigate how and when consensus or fragmentation occurs within societies, and attempt to capture the fickle nature of relationships through a changing social network. I attempt to do this through computationally modeling, mathematical support, and topological data analysis. I am particularly interested in understanding how mechanisms of human behavior lead to fragmentation of society and inequalities such as differential healthcare, school systems, economic opportunities etc. My interest is to propose policy interventions that mitigate social inequalities. 

Selected Past Research Projects

generalized similarity learning with limited data
We investigate the ways in which a machine learning architecture known as Reservoir Computing that loosely resembles neural dynamics  learns concepts such as “similar” and “different” and other relationships between image pairs and generalizes these concepts to previously unseen classes of data. We find that the reservoir acts as a nonlinear filter that projects the input into the high dimensional reservoir space, where inputs from the same category cluster together (see figure), allowing for easy generalization to unseen data. Our architecture outperforms conventional pair-based methods such as Siamese Neural Networks.
Advised by Yiannis Aloimonos and Michelle Girvan
Published in Complexity. Manuscript here
Picture
Synchorinzation patterns in fractal networks: from structure to function
​We investigate complex synchronization patterns such as cluster synchronization and partial amplitude death in networks of coupled Stuart–Landau oscillators with fractal (hierarchical) connectivities. The study of fractal or self-similar topology is motivated by the network of neurons in the brain. Our results show that there is a direct correlation between topology and dynamics (see top figure) - hierarchical networks display hierarchical dynamics.
Advised by Prof. Eckehard Schoell, PhD
Published in Philosophical Transactions of the Royal Society.
Manuscript here
Picture
Picture
Steganography and double key protection using chaotic maps
We developed the image encryption algorithm based on the chaotic logistic map and cat map. A secret key is used to determine initial conditions and the input to the chaotic map function. The Lorenz map is then used for successive pixel encryption. To make the cipher more robust against any attack, the secret key is modified after encrypting each pixel of the image using Arnold’s cat map. Decryption follows the exact reverse. Figures on the right show original (top), encrypted (middle) and recovered after decryption (bottom) with minimal loss. The encrypted image is then hidden using a steganography technique that uses a cover image along with the Lorenz map to determine the location of the pixels to be hidden in the cover. Tests on efficiency and key sensitivity show that our double-key method provides secure image encryption and real-time transmission.
Advised by Dr. Bijil Prakash
Published in Proceedings of Fourth International Conference on Soft Computing, 2014
Manuscript here
Picture
Picture
Picture
Wasserstein autoencoder to learn latent representation of data manifold
Variational autoencoders have been very successful generative models for a variety of tasks, but the use of conventional Gaussian or Gaussian mixture priors are limited in their ability to capture topological or geometric properties of data in the latent representation. Here we introduce a new architecture that learns the prior, showing that our model can encode topological properties of the data in the latent space. We use this to generate improved interpolations along the data manifold with image data. 
Picture
Biconvlstm for violence detection in videos
We introduce a Bidirectional Convolutional LSTM architecture for violence detection. The encoding of temporal features in both directions allows for a better video representation. Our method performs comparably with state of the art architectures on benchmarked datasets.
Conference proceedings, Workshop for Objectionable Content and Misinformation at ECCV 2018. Publication here

Picture
Encoding of chaotic dynamics in the weights of a recurrent neural network
We study the dynamical properties of a machine learning model called Reservoir Computing (RC) using a novel mathematical tool called 'directional fiber' in order to gain insight into how information is encoded through learning. The RC is trained to predict the chaotic Lorenz signal - chaotic signals are just that, mathematically chaotic, and hence extremely hard to predict. We find that the reservoir, which is a dynamical system itself, after training, emulates properties of the system it is trained on, i.e., contains a higher dimensional projection of the Lorenz fixed points.
IJCNN 2019 paper

For questions and code please contact me
Link to my github
Chimeraki
Copyright 2018
Powered by Create your own unique website with customizable templates.
  • Home
  • Scientific Work
  • Visual Art
    • Behind The Lens
    • Canvas Chaotica
  • Movement Art
  • Spilled Ink
  • Travel
  • Contact Me