I currently serve as a Faculty Chair of Jay Chaudhry Software Innovation Centre at Indian Institute of Technology (BHU) Varanasi and also an associated faculty member in the Department of Computer Science and Engineering.
My PhD research deals with the quantum-inspired information retrieval framework with a focus on user-oriented IR, where I investigated how users' information needs can be ascertained during their search behaviour using Information Foraging Theory. My research spans across formal information retrieval (IR) models, in particular, the mathematical formalism(s) behind Quantum Theory; Quantum Probability, Hilbert space and their usage in the study of contextual information retrieval. Besides information retrieval, I am also interested in the synergy between machine learning/deep learning and quantum probabilistic frameworks, and their applications in finance and healthcare sector.
I began my studies in Computer Science at the Chhatrapati Shahu Ji Maharaj University (formerly Kanpur University) in India, culminating in a PhD in Quantum Information Retrieval under the supervision of Dr. Haiming Liu and Dr. Ingo Frommholz from University of Bedfordshire, where I was a Marie Skłodowska-Curie fellow within the QUARTZ ITN consortium. Following my PhD, I joined the department of Statistics as a Research Engineer within the School of Mathematics at University of Leeds, where I worked with Dr. Leonid Bogachev on rapid prototyping of statistical machine learning approaches focussed on Financial AI projects which spin-out as 4-XTRA Technologies Ltd. In April 2022, I moved to the University College London (UCL) as a Postdoctoral Research Fellow, where I worked with Prof. Alexey Zaikin and Dr. Oleg Blyuss on developing statistical machine learning and deep temporal vision models for risk stratification of Prostate cancer progression. From April 2023 onwards, I was a Postdoctoral Research Fellow at the University of Surrey, working on applied research at the intersection of Generative AI, Web 3.0, and Blockchain. Since November '24, I have transitioned to an honorary fellow.
Before graduate school, I was also a part of Google Summer of Code program and the Linux Foundation as an intern with Kubernetes, CNCF and Openstack team at the Open Mainframe Project.
Applications currently open (start date as soon as possible) for an AI research engineer internship at IIT-BHU: apply here through the interest form.
We contribute the first publicly available dataset of factual claims from different platforms and fake YouTube videos on the 2023 Israel-Hamas war for automatic fake YouTube video classification. The FakeClaim data is collected from 60 fact-checking organizations in 30 languages and enriched with metadata from the fact-checking organizations curated by trained journalists specialized in fact-checking. Further, we classify fake videos within the subset of YouTube videos using textual information and user comments. ...
Traditional neural word embeddings are usually dependent on a richer diversity of vocabulary. However, the language models recline to cover major vocabularies via the word embedding parameters, in particular, for multilingual language models that generally cover a significant part of their overall learning parameters. In this work, we present a new compact embedding structure to reduce the memory footprint of the pre-trained language models with a sacrifice of up to 4\% absolute accuracy. The embeddings vectors reconstruction follows a set of subspace embeddings and an assignment procedure via the contextual relationship among tokens from pre-trained language models. ...
This paper focuses on affective emotion recognition, aiming to perform in the subject-agnostic paradigm based on EEG signals. However, EEG signals manifest subject instability in subject-agnostic affective Brain-computer interfaces (aBCIs), which led to the problem of distributional shift. Furthermore, this problem is alleviated by approaches such as domain generalisation and domain adaptation. Typically, methods based on domain adaptation confer comparatively better results than the domain generalisation methods but demand more computational resources given new subjects. We propose a novel framework, meta-learning based augmented domain adaptation for subject-agnostic aBCIs. ...
Item representation holds significant importance in recommendation systems, which encompasses domains such as news, retail, and videos. Retrieval and ranking models utilise item representation to capture the user-item relationship based on user behaviours. While existing representation learning methods primarily focus on optimising item-based mechanisms, such as attention and sequential modelling. However, these methods lack a modelling mechanism to directly reflect user interests within the learned item representations. Consequently, these methods may be less effective in capturing user interests indirectly. To address this challenge, we propose a novel Interest-aware Capsule network (IaCN) recommendation model, a model-agnostic framework that directly learns interest-oriented item representations... ...
The paper introduces a model for interactive image retrieval utilising the geometrical framework of information retrieval (IR). We tackle the problem of image retrieval based on an expressive user information need in form of a textual-visual query, where a user is attempting to find an image similar to the picture in their mind during querying. The user information need is expressed using guided visual feedback based on Information Foraging which lets the user perception embed within the model via semantic Hilbert space (SHS). This framework is based on the mathematical formalism of quantum probabilities and aims to understand the relationship between user textual and image input, where the image in the input is considered a form of visual feedback. We propose SHS, a quantum-inspired approach where the textual-visual query is regarded analogously to a physical system that allows for modelling different... ...
Understanding an information forager's actions during interaction is very important to the study of interactive information retrieval. Although information spread in uncertain information space is substantially complex due to the high entanglement of users’ interacting with information objects (text, image, etc.) and vice versa. However, an information forager, in general, accompanies a piece of information (information diet) while searching (or foraging) alternative contents, typically subject to decisive uncertainty. ...
Query Auto-completion (QAC) is a prominently used feature in search engines, where user interaction with such explicit feature is facilitated by the possible automatic suggestion of queries based on a prefix typed by the user. Existing QAC models have pursued a little on user interaction and cannot capture a user’s information need (IN) context. ...
User implicit feedback plays an important role in recommender systems. However, finding implicit features is a tedious task. This paper aims to identify users' preferences through implicit behavioural signals for image recommendation based on the Information Scent Model of Information Foraging Theory. In the first part, we hypothesise that the users' perception is improved with visual cues in the images as behavioural signals that provide users' information scent during information seeking. ...
A major challenge of recommender systems is to help users locating interesting items. Personalized recommender systems have become very popular as they attempt to predetermine the needs of users and provide them with recommendations to personalize their navigation. However, few studies have addressed the question of what drives the users’ attention to specific content within the collection and what influences the selection of interesting items. To this end, we employ the lens of Information Foraging Theory (IFT) to image recommendation to demonstrate how the user could utilize visual bookmarks to locate interesting images. ...
Pathologists find tedious to examine the status of the sentinel lymph node on a large number of pathological scans. The examination process of such lymph node which encompasses metastasized cancer cells is histopathologically organized. However, the task of finding metastatic tissues is gradual which is often challenging. In this work, we present our deep convolutional neural network based model validated on PatchCamelyon (PCam) benchmark dataset for fundamental machine learning research in histopathology diagnosis. ...
The rich collection of annotated datasets piloted the robustness of deep learning techniques to effectuate the implementation of diverse medical imaging tasks. Over 15% of deaths include children under age five are caused by pneumonia globally. In this study, we describe our deep learning based approach for the identification and localization of pneumonia in Chest X-rays (CXRs) images. Researchers usually employ CXRs for the diagnostic imaging study. Several factors such as positioning of the patient and depth of inspiration can change the appearance of the chest X-ray, complicating interpretation further. ...
In this paper, we present an extension, and an evaluation, to existing Quantum like approaches of word embedding for IR tasks that (1) improves complex features detection of word use (e.g., syntax and semantics), (2) enhances how this method extends these aforementioned uses across linguistic contexts (i.e., to model lexical ambiguity) - specifically Question Classification -, and (3) reduces computational resources needed for training and operating Quantum based neural networks, when confronted with existing models. ...
We propose Parsec, a web-scale State channel for the Internet of Value to exterminate the consensus bottleneck in Blockchain by leveraging a network of state channels which enable to robustly transfer value off-chain. It acts as an infrastructure layer developed on top of Ethereum Blockchain, as a network protocol which allows coherent routing and interlocking channel transfers for trade-off between parties. A web-scale solution for state channels is implemented to enable a layer of value transfer to the internet. Existing network protocol on State Channels include Raiden for Ethereum and Lightning Network for Bitcoin. ...
Web/Competition Chair @ CHILites, SIGCHI'19 HASOC,FIRE Conference'20,
Reviewer @ JOSS '19-'24, IEEE TMI'20, ECIR'20, ACM ICMI'21, ECAI'23, ACMMM'23, MLGH, ICLR'23, SynS & ML, ICML'23, PLOS One
Program Committee @ ACM WebSci'25, ACM CIKM'24, ACM MM'24, ACM MM'23, ML4PS, NeurIPS '19-'22,'24