Anirban Pal

anirbanpal.wbut@gmail.com

Driving enterprise modernization through the strategic architecture of agentic AI systems, I bridge the gap between two decades of industry battle-testing and PhD-level Computer Science research. I specialize in translating emerging theory into scalable production by refactoring high-friction legacy processes into autonomous, intelligent workflows. My focus is on the strategic integration of large language models, automated behavioral analytics, and high-impact predictive systems. By prioritizing both data integrity and operational velocity, I build the resilient infrastructure necessary for an AI-first future.

Work Experience

Gallup, Inc.
Data Scientist
Oct 2022 – Present
  • Orchestrated an agentic AI framework for a federal research project, leveraging real-time Transformer-based NLP to quantify leadership dynamics and deploy live behavioral scoring within a simulated environment.
  • Architected a generative AI pipeline for global verbatim analysis, real-time translation, synthesizing NER and toxicity detection with LLM-powered thematic clustering to automate the extraction of strategic insights from unstructured survey data.
  • Developed a predictive model-backed tool to estimate survey question length across diverse languages and regions, enabling more efficient questionnaire design and accurate global project scoping.
  • Modernized World Poll legacy data infrastructure and several manual touchpoints by engineering an automated AWS-driven processing engine, accelerating global result generation by over 90% and significantly impacting profitability while ensuring data integrity at scale.
  • Engineering scalable agentic tools and LLM orchestration layers to eliminate process friction in legacy enterprise architectures, accelerating modernization through the rapid deployment of autonomous AI workflows.
VMware, Inc.
Senior Data Product Manager
Apr 2022 – Oct 2022
  • Developed a sophisticated churn and retention engine for the End User Computing (EUC) suite, using telemetry data to pinpoint friction in the customer lifecycle.
  • Architected a unified Power BI telemetry hub that consolidated usage data from 12+ global products. This standardized "single source of truth" reduced reporting lead times by 50% and empowered 15+ Product Managers to perform real-time cross-platform benchmarking.
Blue Cross Blue Shield of Nebraska
Senior Data Analytics Engineer
Aug 2014 – Apr 2022
  • Directed the enterprise-wide Power BI rollout for 200+ users, establishing a "single source of truth" for clinical and financial data that reduced manual reporting cycles by over 80%.
  • Developed a high-risk pregnancy identification model that enabled early clinical intervention, resulting in an estimated $1M+ in annual cost avoidance by reducing catastrophic neonatal claims.
  • Architected a risk-stratification framework to bracket high-risk members, increasing the efficiency of care management outreach by 25% through improved "True Positive" identification.
  • Automated manual research and reporting tasks, transforming months of legacy data processing into streamlined, repeatable Python and cloud-driven pipelines (Azure).
  • Analyzed the efficacy of "Gap in Care" outreach campaigns, leveraging predictive insights to increase HEDIS-related screening completions (e.g., Colorectal, Breast Cancer).
Oriental Trading Company
Business Intelligence and Reporting Developer
Oct 2013 – Aug 2014
  • Architected enterprise-scale dashboards and KPI frameworks in Tableau, synthesizing disparate operational data into high-visibility business metrics to drive executive decision-making.
  • Engineered robust ETL pipelines using UNIX and Netezza, serving as a data quality lead to ensure high-fidelity reporting and seamless data warehouse integration.
  • Spearheaded the enterprise-wide adoption of Tableau through cross-functional technical training, transitioning manual reporting groups into self-service analytics workflows.
Cognizant Technology Solutions
Software Developer
Sep 2012 – Oct 2013
  • Architected end-to-end Data Warehousing (EDW) and Business Intelligence solutions, leveraging Informatica, Teradata, and Oracle to transform complex legacy data into actionable executive insights.
  • Engineered and optimized high-performance database objects and complex SQL queries across Sybase and SQL Server, significantly improving data retrieval speeds for mission-critical client-server applications.
  • Designed logical and physical data models for multi-tenant architectures, ensuring data integrity and scalability across diverse industry verticals including Hospitality and Financial Services.
  • Synthesized client requirements into technical roadmaps, leading the development and deployment of customized dashboards using IBM Cognos and Business Objects.
  • Directed project estimation and RFP technical responses for new business initiatives, while mentoring junior developers on modernizing legacy C#, Perl, and Visual Basic systems.

Education

PhD in Computer Science
AI Specialization · University of Nebraska at Omaha
Research Focus: Applied AI, Socio-Technical Impact of AI
MS in Data Science
Bellevue University, Bellevue, Nebraska
BTech in Electronics & Communication
West Bengal University of Technology, WB, India

Selected Publications

"Understanding the role of Pax5 in development of taxane-resistant neuroendocrine like prostate cancers"
Cell Death and Disease — Nature Portfolio
In collaboration with the University of Nebraska Medical Center (UNMC).
Engineered a pipeline to analyze genomic sequences, contributing to high-impact biomedical research published in a top-tier peer-reviewed journal.

Professional Service & Peer Review

  • Invited Reviewer — Conferences: ECIR (Information Retrieval), UMAP (User Modeling, Adoption & Personalization), DG.O (Digital Government)
  • Invited Reviewer — Journals: PeerJ (Computer Science & Life Sciences)
  • Program Committee, Track Chair — DG.O 2026: Track 12: Digital Government for Public Health & Healthcare

Actively contributing to the global research community through publications and evaluating cutting-edge advancements in information retrieval and adaptive AI systems.