Workshop
The 3rd DL4C Workshop: Emergent Possibilities and Challenges in Deep Learning for Code
Zijian Wang · Ying Sheng · Giovanni Zappella · Qian Liu · Devjeet Roy · Gabriel Orlanski · Zora Zhiruo Wang · Wen-Ding Li
Garnet 218-219
Sun 27 Apr, 5:30 p.m. PDT
Chat is not available.
Timezone: America/Los_Angeles
Schedule
Sun 5:30 p.m. - 6:00 p.m.
|
Poster Setup
(
Poster Setup
)
>
|
🔗 |
Sun 6:00 p.m. - 6:20 p.m.
|
Opening remarks
(
Intro
)
>
SlidesLive Video |
🔗 |
Sun 6:20 p.m. - 7:00 p.m.
|
Invited Talk: Inducing Functions to Improve LLM Agents by Daniel Fried
(
Invited talk
)
>
SlidesLive Video |
Daniel Fried 🔗 |
Sun 7:30 p.m. - 8:10 p.m.
|
Invited Talk: Multimodal Code Generation for Embodied AI Agents by Tao Yu
(
Invited talk
)
>
SlidesLive Video |
Tao Yu 🔗 |
Sun 8:10 p.m. - 8:50 p.m.
|
Invited Talk: Designing, Building, and Training Effective Software Engineering Agents by Xingyao Wang
(
Invited talk
)
>
SlidesLive Video |
Xingyao Wang 🔗 |
Sun 8:50 p.m. - 9:10 p.m.
|
Spotlights
SlidesLive Video |
🔗 |
Sun 10:30 p.m. - 11:10 p.m.
|
Invited Talk: AI for Software Engineering: Where are we now, and what lies ahead? by Alex Gu
(
Invited talk
)
>
SlidesLive Video |
Alex Gu 🔗 |
Sun 11:10 p.m. - 11:50 p.m.
|
Inited Talk: The Future of Multimodal AI Applications by Stefania Druga
(
Invited talk
)
>
SlidesLive Video |
Stefania Druga 🔗 |
Mon 12:00 a.m. - 1:30 a.m.
|
Poster session
|
🔗 |
Mon 1:30 a.m. - 2:10 a.m.
|
Invited Talk: From code completion to agentic tasks by Baptiste Rozière
(
Invited talk
)
>
SlidesLive Video |
Baptiste Rozière 🔗 |
Mon 2:10 a.m. - 2:20 a.m.
|
Closing
SlidesLive Video |
🔗 |
-
|
Contextual Augmented Multi-Model Programming (CAMP): A Local-Cloud Copilot Solution ( Poster ) > link | Yuchen Wang · Shangxin Guo · Chee Wei Tan 🔗 |
-
|
ML-Dev-Bench: Comparative Analysis of AI Agents on ML development workflows ( Poster ) > link | Harshith Padigela · Chintan Shah · Dinkar Juyal 🔗 |
-
|
Themisto: Jupyter-Based Runtime Benchmark ( Poster ) > link | Konstantin Grotov · Sergey Titov 🔗 |
-
|
Toward Trustworthy Neural Program Synthesis ( Poster ) > link | Wen-Ding Li · Darren Key · Kevin Ellis 🔗 |
-
|
Optimizing Small Language Models for NL2SQL ( Poster ) > link | Wenqi Pei · Xu Hailing · Henry Zhao · CHEN HAN · zining zhang · Shizheng Hou · Luo Pingyi · Bingsheng He 🔗 |
-
|
DISC: Dynamic Decomposition Improves LLM Inference Scaling ( Poster ) > link | Jonathan Light · Wei Cheng · Yue Wu · Masafumi Oyamada · Mengdi Wang · Santiago Paternain · Haifeng Chen 🔗 |
-
|
Improving Automated Issue Resolution via Comprehensive Repository Exploration ( Poster ) > link | ma yingwei · Yue Liu 🔗 |
-
|
On Pretraining For Project-Level Code Completion ( Poster ) > link | Maksim Sapronov · Evgenii Glukhov 🔗 |
-
|
Tasks, Challenges, and Paths Towards AI for Software Engineering ( Poster ) > link | Alex Gu · Naman Jain · Wen-Ding Li · Manish Shetty · Kevin Ellis · Koushik Sen · Armando Solar-Lezama 🔗 |
-
|
GenePrune : Automated Pruning of Large Language Models for Code using Genetic Algorithm ( Poster ) > link | Nikhil Reddy Varimalla · Ruturaj Godse 🔗 |
-
|
From Pseudo-Code to Source Code: A Self-Supervised Search Approach ( Poster ) > link | Adithya Kulkarni · Mohna Chakraborty · Yonas Sium · Sai Valluri · Wei Le · Qi Li 🔗 |
-
|
NextCoder: Robust Adaptation of Code LMs to Diverse Code Edits
(
Poster
)
>
link
SlidesLive Video |
Tushar Aggarwal · Swayam Singh · Abhijeet Awasthi · Aditya Kanade · Nagarajan Natarajan 🔗 |
-
|
GRAIL: Graph Edit Distance and Node Alignment using LLM-Generated Code ( Poster ) > link | Samidha Verma · Arushi Goyal · Ananya Mathur · Ankit Anand · Sayan Ranu 🔗 |
-
|
KernelBench: Can LLMs Write Efficient GPU Kernels?
(
Spotlight
)
>
link
SlidesLive Video |
Anne Ouyang · Simon Guo · Simran Arora · Alex Zhang · William Hu · Christopher Re · Azalia Mirhoseini 🔗 |
-
|
Training Software Engineering Agents and Verifiers with SWE-Gym ( Poster ) > link | Jiayi Pan · Xingyao Wang · Graham Neubig · Navdeep Jaitly · Heng Ji · Alane Suhr · Yizhe Zhang 🔗 |
-
|
Arctic-SnowCoder: Demystifying High-Quality Data in Code Pretraining ( Poster ) > link | Yuxiang Wei · Hojae Han · Rajhans Samdani 🔗 |
-
|
EnvBench: A Benchmark for Automated Environment Setup ( Poster ) > link | Aleksandra Eliseeva · Alexander Kovrigin · Ilia Kholkin · Egor Bogomolov · Yaroslav Zharov 🔗 |
-
|
SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution ( Poster ) > link | Chengxing Xie · Bowen Li · Chang Gao · he du · Wai Lam · Difan Zou · Kai Chen 🔗 |
-
|
Code2JSON: Can a Zero-Shot LLM Agent Extract Code Features for Code RAG?
(
Poster
)
>
link
SlidesLive Video |
Aryan Singhal · rajat ghosh · Ria Mundra · Harshil Dadlani · Debojyoti Dutta 🔗 |
-
|
CLOVER: A Test Case Generation Benchmark with Coverage, Long-Context, and Verification ( Poster ) > link | Jiacheng Xu · Bo Pang · Jin Qu · Hiroaki Hayashi · Caiming Xiong · Yingbo Zhou 🔗 |
-
|
Diagnosing Robotics Systems Issues with Large Language Models – A Case Study ( Poster ) > link | Jordis Herrmann · Aswath Gopinath · Mikael Norrlof · Mark Mueller 🔗 |
-
|
LLM Program Optimization via Retrieval Augmented Search
(
Poster
)
>
link
SlidesLive Video |
Sagnik Anupam · Alexander Shypula · Osbert Bastani 🔗 |
-
|
Do LLMs Understand Code Preference? Training Code Preference Models via Synthetic Code Evolution ( Poster ) > link | Jiawei Liu · THANH NGUYEN · Mingyue Shang · Hantian Ding · Xiaopeng Li · Yu Yu · Varun Kumar · Zijian Wang 🔗 |
-
|
Parameter-Efficient Instruction Tuning Code Large Language Models: An Empirical Study ( Poster ) > link | Terry Yue Zhuo · Armel Zebaze · Leandro Von Werra · Harm de Vries · Qian Liu · Niklas Muennighoff 🔗 |
-
|
LoRACode: LoRA Adapters for Code Embeddings ( Poster ) > link | Saumya Chaturvedi · Aman Chadha · Laurent Bindschaedler 🔗 |
-
|
Adaptive Self-improvement LLM Agentic System for ML Library Development
(
Spotlight
)
>
link
SlidesLive Video |
Genghan Zhang · Victor Weixin Liang · Olivia Hsu · Kunle Olukotun 🔗 |
-
|
Black-Box Adversarial Attacks on LLM-Based Code Completion ( Poster ) > link | Slobodan Jenko · Niels Mündler · Jingxuan He · Mark Vero · Martin Vechev 🔗 |
-
|
Programming with Pixels: Towards Generalist Software Engineering Agents ( Poster ) > link | Pranjal Aggarwal · Sean Welleck 🔗 |
-
|
TypyBench: Evaluating LLM Type Inference for Untyped Python Repositories
(
Poster
)
>
link
SlidesLive Video |
Yuhe Jiang · Xun Deng · Jiacheng Yang · Honghua Dong · Gennady Pekhimenko · Fan Long · Xujie Si 🔗 |
-
|
ML-BENCH: EVALUATING LARGE LANGUAGE MODELS AND AGENTS FOR MACHINE LEARNING TASKS ON REPOSITORY-LEVEL CODE ( Poster ) > link |
22 presentersXiangru Tang · Yuliang Liu · Zefan Cai · Daniel Shao · Junjie Lu · Yichi Zhang · Zexuan Deng · Helan Hu · Kaikai An · Ruijun Huang · Shuzheng Si · Chen Sheng · Haozhe Zhao · Liang Chen · Tianyu Liu · Yujia Qin · Wangchunshu Zhou · Yilun Zhao · Zhiwei Jiang · Baobao Chang · Arman Cohan · Mark Gerstein |
-
|
InterTrans: Leveraging Transitive Intermediate Translations to Enhance LLM-based Code Translation ( Poster ) > link | Marcos Macedo · Yuan Tian · Pengyu Nie · Filipe Cogo · Bram Adams 🔗 |
-
|
Generate-Feedback-Refine: How Much Does Model Quality in Each Role Matter? ( Poster ) > link | Xiang Pan · Jason Phang · Guy Davidson · Ethan Perez 🔗 |
-
|
ONE MODEL TO TRAIN THEM ALL: HIERARCHICAL SELF-DISTILLATION FOR ENHANCED EARLY LAYER EMBEDDINGS
(
Poster
)
>
link
SlidesLive Video |
Andrea Gurioli · Federico Pennino · Joao Monteiro · Maurizio Gabbrielli 🔗 |
-
|
Type-Aware Constraining for Code LLMs ( Poster ) > link | Niels Mündler · Jingxuan He · Hao Wang · Koushik Sen · Dawn Song · Martin Vechev 🔗 |
-
|
Teaching Language Models to Critique via Reinforcement Learning ( Poster ) > link | Zhihui Xie · Jie chen · Liyu Chen · Weichao Mao · Jingjing Xu · Lingpeng Kong 🔗 |
-
|
Automated Benchmark Generation for Repository-Level Coding Tasks ( Poster ) > link | Konstantinos vergopoulos · Mark Mueller · Martin Vechev 🔗 |
-
|
CodeTransEngine: Ready-to-use Backend for LLM-based Code Translation ( Poster ) > link | Marcos Macedo · Yuan Tian · Bram Adams 🔗 |
-
|
Cracking the Code of Action: A Generative Approach to Affordances for Reinforcement Learning ( Poster ) > link | Lynn Cherif · Flemming Kondrup · David Venuto · Ankit Anand · Doina Precup · Khimya Khetarpal 🔗 |
-
|
CodeEditorBench: Evaluating Code Editing Capability of LLMs ( Poster ) > link |
16 presentersJiawei Guo · Ziming Li · Xueling Liu · Kaijing Ma · Tianyu Zheng · Zhouliang Yu · Ding Pan · Yizhi Li · Ruibo Liu · Yue Wang · Shuyue Guo · Xingwei Qu · Xiang Yue · Ge Zhang · Wenhu Chen · Jie Fu |
-
|
Evolving RL: Discovering New Activation Functions using LLMs ( Poster ) > link | Kalyan V Nadimpalli · Shashank Reddy Chirra · Pradeep Varakantham · Stefan Bauer 🔗 |
-
|
Does Instruction Tuning Reduce Diversity? A Case Study Using Code Generation Copy ( Poster ) > link | Alexander Shypula · Shuo Li · Botong Zhang · Vishakh Padmakumar · Kayo Yin · Osbert Bastani 🔗 |
-
|
Shedding Light on Task Decomposition in Program Synthesis: The Driving Force of the Synthesizer Model ( Poster ) > link | Janis Zenkner · Tobias Sesterhenn · Christian Bartelt 🔗 |
-
|
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation ( Poster ) > link |
13 presentersRabiul Awal · Mahsa Massoud · Zichao Li · Aarash Feizi · Suyuchen Wang · Christopher Pal · Aishwarya Agrawal · David Vazquez · Siva Reddy · Juan A. Rodriguez · Perouz Taslakian · Spandana Gella · Sai Rajeswar |
-
|
BaxBench: Can LLMs Generate Correct and Secure Backends? ( Poster ) > link | Mark Vero · Niels Mündler · Victor Chibotaru · Veselin Raychev · Maximilian Baader · Nikola Jovanović · Jingxuan He · Martin Vechev 🔗 |
-
|
Generating Code to Verify Cryptic Crossword Reasoning
(
Poster
)
>
link
SlidesLive Video |
Martin Andrews · Sam Witteveen 🔗 |
-
|
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions ( Poster ) > link |
14 presentersZiming Li · Qianbo ZANG · David Ma · Jiawei Guo · Tianyu Zheng · Minghao Liu · Xinyao Niu · Yue Wang · Jian Yang · Jiaheng Liu · Wanjun Zhong · Wangchunshu Zhou · Wenhao Huang · Ge Zhang |