Work

  • Nov 2020 - present

    PACT Pharma (South SF, CA)

    Bioinformatics Data Scientist

    PACT Pharma is a clinical stage immuno-oncology company dedicated to engineering personalized TCR T-cell therapies. As a Bioinformatics data scientist, I work on multiple projects including but not limited to selecting neoantigens to target, building predictive models, analyzing clinical data, developing cloud scalable pipelines etc.

  • Jun-Sep 2019

    Adaptive Biotechnologies (Seattle, WA)

    Computational Biology Intern, Antigen Map

    Antigen Map is a collaborative project of Adaptive Biotech and Microsoft Research where the goal is identifying disease-associated T-Cell receptor sequences and repertoire features to help build immune-based diagnostics. While I was an intern in the Antigen Map team, I developed a novel computational method (Ordered Clustering of Clones using weak Classifiers or OC3), for the identification of TCR-antigen pairs that bind, from data generated by Adaptive's MIRA assay. OC3 comprises feature generation, using maximum-likelihood for ordered clustering using those features, and cross-validation to select the optimal features. I built a Python package (oc3-py) that includes scalable implementation of the method and easy-to-use CLI commands. Using oc3-py, we identified tens of thousands of antigen-specific TCRs by analyzing data from multiple MIRA experiments.

Education

    Doctorate of Philosophy

    Dept. of ECE (ML and data science), UC San Diego 2020

    Master of Engineering

    Dept. of ECE, Indian Institute of Science, Bangalore 2015

    Bachelor of Engineering

    Dept. of ECE, PEC University of Technology, Chandigarh (India) 2013

PhD research

  • MINING-D: A tool for de novo inference of immunoglobulin D genes

    (Joint work with Pavel Pevzner, Yana Safonova , Massimo Franceschetti, and Ramesh Rao)

    Antibodies provide specific binding to an enormous range of antigens and represent a key component of the adaptive immune system. With immunosequencing, we can sample antibody repertoires and generate millions of reads that can provide insights into monitoring immune response to disease and vaccination. Most immunogenomics studies rely on the reference germline genes rather than the germline genes in a specific patient. This is deficient as the set of known germline V, D, and J genes is incomplete (particularly for non-Europeans humans and non-human species) and contains alleles that resulted from sequencing and annotation errors. We made a tool for de novo inference of germline D genes of a patient using their immunosequencing data.

    We first fomulated this problem as a mathematical string reconstruction problem and proposed an algorithm to find the optimal solution in linear time using dynamic programming. Based on this, we developed the heuristic algorithm, Method for Inference of Immunoglobulin Genes -D (MINING-D), that additionally takes into account the complexities of the real data ignored by the string reconstruction problem. Using MINING-D, we inferred 25 novel D genes across 5 species on ~600 publicly available immunosequencing datasets and validated the inferred novel D genes by analyzing ~100 diverse WGS datasets. We also revealed D genes that are potentially associated with antigen-specific response and showed that heterozygous D genes can be used for V gene haplotyping.

  • Analysis of Fatty acid hydroxy fatty acids (FAHFAs) in large human cohorts.

    (Joint work with Jain Lab (UCSD School of Medicine) and Shriram Nallamshetty)

    In a joint work with Jain lab at UCSD School of Medicine, we examined the role of FAHFAs - a class of bioactive lipids that exert anti-diabetic effects in preclinical models - in human diseases, and investigated their regulation with various physiological conditions including fasting, feeding, or specific dietary interventions. We analyzed metabolomics and phenotype data from multiple large human cohorts with a total of ~20,000 human subjects. The results will be posted as a preprint soon.

  • Understanding the relationships among PTSD, depression, hostility, anger, and aggression in post-combat veterans.

    (Joint work with Dewleen Baker, Abigail Angkaw, Massimo Franceschetti, and Ramesh Rao)

    In a joint work with VA Healthcare system and UCSD School of Medicine, we studied the influence of war on the mental health of veterans. Post-traumatic stress disorder has been linked with negative outcomes like hostility, depression, physical aggression, and suicidal tendencies. These are found to occur more often among veterans with PTSD than among civilians or veterans without PTSD. We analyzed clinical mental health data from a sample of veterans returning from Iraq or Afghanistan to better understand the inter-connections between these.

Publications