Publications
Publications by categories in reversed chronological order. For recent publications, please visit my Google Scholar Page.
2025
- Combinatorial Multi-armed Bandits: Arm Selection via Group TestingTransactions on Machine Learning Research (TMLR), 2025
- Granite Guardian: Comprehensive LLM SafeguardingIn 2025 Annual Conference of the North American Chapter of the Association for Computational Linguistics Industry Track, 2025
- MOPI-HFRS: A Multi-objective Personalized Health-aware Food Recommendation System with LLM-enhanced InterpretationIn KDD 2025, 2025
- NGQA: a nutritional graph question answering benchmark for personalized health-aware nutritional reasoningIn Association for Computational Linguistics (ACL 2025), 2025
- Sequential uncertainty quantification with contextual tensors for social targetingKnowledge and Information Systems, 2025
- Cross-Examiner: Evaluating Consistency of Large Language Model-Generated ExplanationsarXiv preprint arXiv:2503.08815, 2025
- PEEL the Layers and Find Yourself: Revisiting Inference-Time Data Leakage for Residual Neural NetworksIn 2025 IEEE Conference on Secure and Trustworthy Machine Learning (SaTML), 2025
- Are Large Language Models Effective in Clinical Trial Design? A Study on Baseline Feature GenerationIn Findings of the Association for Computational Linguistics: NAACL 2025, 2025
- Granite Guardian: Comprehensive LLM SafeguardingIn Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 3: Industry Track), 2025
- EfficientLLM: Efficiency in Large Language ModelsarXiv preprint arXiv:2505.13840, 2025
- AutoData: A Multi-Agent System for Open Web Data CollectionarXiv preprint arXiv:2505.15859, 2025
- Generating symbolic plans using transformer-based modelsMay 2025US Patent App. 18/509,359
- Group Fair Federated Learning via Stochastic Kernel RegularizationTransactions on Machine Learning Research, May 2025
- Context Attribution with Multi-Armed Bandit OptimizationarXiv preprint arXiv:2506.19977, May 2025
- Thinking Fast and Slow in Human and Machine IntelligenceCommunications of the ACM, May 2025
- Highlight All the Phrases: Enhancing LLM Transparency through Visual Factuality IndicatorsIn ACM Conference on AI, Ethics and Society (AIES 2025), May 2025
- Language Models Coupled with Metacognition Can Outperform Reasoning ModelsarXiv preprint arXiv:2508.17959, May 2025
- The Unlearning Mirage: A Dynamic Framework for Evaluating LLM UnlearningIn Second Conference on Language Modeling, May 2025
- Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational AgentsIn Findings of the Association for Computational Linguistics: ACL 2025, May 2025
2024
- On the prospects of incorporating large language models (llms) in automated planning and scheduling (aps)In Proceedings of the International Conference on Automated Planning and Scheduling, May 2024
- Harnessing Large Language Models for Planning: A Lab on Strategies for Success and Mitigation of PitfallsIn AAAI Conference on Artificial Intelligence, May 2024
- Provable Knowledge Transfer using Successor Feature for Deep Reinforcement LearningMay 2024
- EXPLORER: Exploration-guided Reasoning for Textual Reinforcement LearningIn European Chapter of the Association for Computational Linguistics, May 2024
- Language Guided Exploration for RL Agents in Text EnvironmentsIn ACL Findings 2024, May 2024
- Detectors for safe and reliable llms: Implementations, uses, and limitationsarXiv preprint arXiv:2403.06009, May 2024
- Variance Reduction Can Improve Trade-Off in Multi-Objective LearningIn ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2024
- STARLING: Self-supervised Training of Text-based Reinforcement Learning Agent with Large Language ModelsACL 2024 Findings, May 2024
- Leveraging Visual Handicaps for Text-Based Reinforcement LearningIn ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2024
- Moral High Ground: A text-based games benchmark for moral evaluationMay 2024
- On the Effects of Fine-tuning Language Models for Text-Based Reinforcement LearningEACL 2024, May 2024
- Facilitating Human-LLM Collaboration through Factuality Scores and Source AttributionsIn ACM CHI Conference on Human Factors in Computing Systems, May 2024
- SF-DQN: Provable knowledge transfer using successor feature for deep reinforcement learningIn International Conference on Machine Learning (ICML 2024), May 2024
- Ctbench: A comprehensive benchmark for evaluating language model capabilities in clinical trial designarXiv preprint arXiv:2406.17888, May 2024
- Towards Aligning Language Models with Textual FeedbackIn EMNLP 2024, May 2024
- Beyond Visual Augmentation: Investigating Bias in Multi-Modal Text GenerationIn NAACL 2024 Workshop on Trustworthy Natural Language Processing (NAACL TrustNLP 2024), May 2024
- Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational AgentsIn NeurIPS 2024 Workshop on Socially Responsible Language Modeling Research (SoLaR), May 2024
- Examining Trustworthiness of LLM-as-a-Judge Systems in a Clinical Trial Design BenchmarkIn 2024 IEEE International Conference on Big Data (BigData), May 2024
2023
- Plansformer: Generating Multi-Domain Symbolic Plans using TransformersMay 2023
- Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent ApproachIn Proceeding of 11th International Conference on Learning Representations (ICLR 2023), May 2023
- Fast and slow planningarXiv preprint arXiv:2303.04283, May 2023
- Understanding the capabilities of large language models for automated planningarXiv preprint arXiv:2305.16151, May 2023
- Complexworld: A large language model-based interactive fiction learning environment for text-based reinforcement learning agentsIn International Joint Conference on Artificial Intelligence 2023 Workshop on Knowledge-Based Compositional Generalization, May 2023
- MISMATCH: Fine-grained Evaluation of Machine-generated Text with Mismatch Error TypesACL 2023 (Findings), May 2023
- Learning Symbolic Rules over Abstract Meaning Representations for Textual Reinforcement LearningACL 2023, May 2023
- Value-based fast and slow ai nudgingarXiv preprint arXiv:2307.07628, May 2023
- Probabilistic Rule Induction from Event Sequences with Logical Summary Markov Models.In IJCAI, May 2023
- Plansformer Tool: Demonstrating Generation of Symbolic Plans Using Transformers.In IJCAI, May 2023
- On the convergence and sample complexity analysis of deep q-networks with $\backslashepsilon $-greedy explorationAdvances in Neural Information Processing Systems, May 2023
- Plan-SOFAI: A neuro-symbolic planning architectureIn Neuro-Symbolic Learning and Reasoning in the era of Large Language Models, May 2023
- To Transfer or Not to Transfer: Suppressing Concepts from Source RepresentationsTransactions on Machine Learning Research, May 2023
2022
- Eye of the Beholder: Improved Relation Generalization for Text-based Reinforcement Learning AgentsIn Proceedings of the 36th AAAI Conference on Artificial Intelligence, May 2022
- Case-based Reasoning for Better Generalization in Text-Adventure GamesIn Proceedings of 10th International Conference on Learning Representations (ICLR 2022), May 2022
- Auto-Transfer: Learning to Route Transferrable RepresentationsIn Proceedings of 10th International Conference on Learning Representations (ICLR 2022), May 2022
- SCERL: A Text-based Safety Benchmark for Reinforcement Learning ProblemsMay 2022
- VISUALHANDICAPS: Systematic Handicaps for Text-based GamesMay 2022
- SCERL: A Benchmark for intersecting language and safe reinforcement learningIn Second Workshop on Language and Reinforcement Learning, May 2022
- Mitigating gradient bias in multi-objective learning: A provably convergent stochastic approacharXiv preprint arXiv:2210.12624, May 2022
- A Text-based Safety Benchmark for Reinforcement Learning ProblemsIn Annual Conference on Neural Information Processing Systems, May 2022
- Targeted advertising on social networks using online variational tensor regressionarXiv preprint arXiv:2208.10627, May 2022
- Influence maximization on social networks with tensor banditsApr 2022US Patent App. 17/069,829
- Controllable Concept Transfer of Intermediate RepresentationsApr 2022
- X-factor: A cross-metric evaluation of factual correctness in abstractive summarizationIn Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, Apr 2022
- Plansformer: Generating symbolic plans using transformersarXiv preprint arXiv:2212.08681, Apr 2022
2021
- Efficient Text-based Reinforcement Learning by Jointly Leveraging State and Commonsense Graph RepresentationsIn Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL), Apr 2021
- A hybrid neuro-symbolic approach for text-based games using inductive logic programmingIn Combining learning and reasoning: programming languages, formalisms, and representations, Apr 2021
- Epistemic Planning in a Fast and Slow Setting.In TFSOCTAI@ AAAI Fall Symposium, Apr 2021
2020
- Enhancing text-based reinforcement learning agents with commonsense knowledgearXiv preprint arXiv:2005.00811, Apr 2020
- Thinking fast and slow in ai (2020)arXiv preprint arXiv:2010.06002, Apr 2020
2018
- Scalable Multitask and Lifelong LearningApr 2018
- Lifelong Learning with Output KernelsApr 2018
2017
- Multi-Task Multiple Kernel Relationship LearningIn Proceedings of the 17th SIAM International Conference on Data Mining (SDM 2017), Houston, Texas, USA, 2017, Apr 2017
- Self-Paced Multitask Learning with Shared KnowledgeIn Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI-17), Melbourne, Australia, 2017., Apr 2017
- Co-clustering for multitask learningarXiv preprint arXiv:1703.00994, Apr 2017
- Active learning from peersAdvances in Neural Information Processing Systems, Apr 2017
- Online and adaptive methods for multitask learningCarnegie Mellon University, Apr 2017
- Multitask matrix completion for learning protein protein interactions across diseasesJournal of Computational Biology, Apr 2017
2016
- Adaptive Smoothed Online Multi-Task LearningIn Advances in Neural Information Processing Systems (NIPS 2016), Apr 2016
- Multitask Matrix Completion for Learning Protein Interactions Across DiseasesIn International Conference on Research in Computational Molecular Biology (RECOMB), Apr 2016
2015
- Predicting Workplace Incidents with Temporal Graph-guided Fused LassoApr 2015
2013
- Learning Latent Tree Structure for Natural LanguageApr 2013
2011
- Hybrid hierarchical clustering: An experimental analysisUniversity of Kentucky, Lexington, Technical Report: CMIDA-HiPSCCS, Apr 2011
- Hybrid bisect K-means clustering algorithmIn 2011 International Conference on Business Computing and Global Informatization, Apr 2011
- A New Term Weighting Scheme for Document ClusteringApr 2011
- Cluster-Based Term Weighting and Document Ranking ModelsUniversity of Kentucky, Apr 2011
- A New Term Weighting Scheme for Document ClusteringIn DMIN 2011: proceedings of the 2011 international conference on data mining (Las Vegas NV, July 18-21, 2011), Apr 2011