I work at the intersection of AI and software engineering, building neurosymbolic tools that pair large language models with static program analysis and formal verification to make software more reliable β advised by Dr. Umar Farooq at Louisiana State University.
|
Understanding and Detecting Platform-Specific Violations in Android Auto Apps
ποΈ IEEE/ACM AST 2026 β Oral Presentation Built AutoComply, a static analysis framework using a Car-Control Flow Graph (CCFG) to detect compliance violations in Android Auto apps. Detected 27 violations across 31 apps β 13Γ more than Android Lint. |
|
MobileDev-Bench: A Benchmark for Issue Resolution in Mobile Application Development
β³ In Submission A benchmark of 407 real-world issue-resolution tasks across 19 production mobile apps (Android, React Native, Flutter). Frontier LLMs achieve only 3.39β5.21% resolution rates, exposing a critical gap in mobile SE capability. |
|
How Far Can LLMs Go in Generating Android Lint Checks from Natural Language?
β³ In Submission LintBench, a 113-check benchmark evaluating LLMs on generating Android Lint checks from natural language. Across four LLMs, API retrieval and execution-guided repair raise success from 30%β65% to 57%β85%. Yet most compiled checks that fail miss required issues, making correct issue-detection logic the central challenge. |
|
Understanding Bugs in Quantum Simulators: An Empirical Study
β³ In Submission An empirical study of 394 confirmed bugs across 12 open-source quantum simulators. We find that most failures surface post-deployment, logical bugs often produce silently incorrect outputs, and many critical issues trace back to classical infrastructure rather than quantum logic itself. |
π¬ Working on VeriLens, a neurosymbolic study pairing LLM-synthesized specifications with the SnaKt/Viper/Z3 formal verifier
π§βπ« Mentoring undergraduate students on Kotlin and Java static analysis research
πΌ Open to Summer 2027 internships




