최종학력
Ph.D. In Computer Science, Yonsei University
주요 연구
- Foundation Models- Multimodal Large Language Models- Data-driven Understanding
주요 논문/저서
(* denotes equal contribution) (2025) * Heegyu Kim, Taeyang Jeon, **Seungtaek Choi**, Ji Hoon Hong, Dong Won Jeon, Ga-Yeon Baek, Gyeong-Won Kwak, Dong-Hee Lee, Jisu Bae, Chihoon Lee, Yunseo Kim, Seon-Jin Choi, Jin-Seong Park, Sung Beom Cho, Hyunsouk Cho. Towards Fully-Automated Materials Discovery via Large-Scale Synthesis Dataset and Expert-Level LLM-as-a-Judge. **CIKM 2025**. * Wansik Jo, Jooyeong Na, Soyeon Hong, **Seungtaek Choi**, Hyunsouk Cho. Rethinking the Training Paradigm of Discrete Token-Based Multimodal LLMs: An Analysis of Text-Centric Bias. CIKM 2025. * Yeonjoon Jung, **Seungtaek Choi**, Seung-won Hwang. Overcoming Source Object Grounding for Semantic Image Editing. **TACL 2025**. * Sungjun Han, Juyoung Suk, Suyeong An, Hyungguk Kim, Kyuseok Kim, Wonsuk Yang, **Seungtaek Choi**, Jamin Shin. Trillion 7B Technical Report. **arXiv 2025** (technical report of Trillion-7B-preview). * Heegyu Kim, Taeyang Jeong, Seunghwan Choi, **Seungtaek Choi**, Hyunsouk Cho. FLEX: Expert-level False-Less EXecution Metric for Reliable Text-to-SQL Benchmark. **NAACL 2025**. * Jin-Young Kim, Soonwoo Kwon, Hyojun Go, Yunsung Lee, **Seungtaek Choi**, Hyun-Gyoon Kim. ScoreCL: Augmentation-Adaptive Contrastive Learning via Score-Matching Function. **Machine Learning 2025**. (2024) * Yeonjoon Jung, Jaseseong Lee, **Seungtaek Choi**, Dohyeon Lee, Minsoo Kim, Seung-won Hwang. Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding. **EMNLP 2024**. * Seungduk Kim*, **Seungtaek Choi***, Myeongho Jeong. Efficient and Effective Vocabulary Expansion Towards Multilingual Large Language Models. **arXiv 2024** (technical report of EEVE-Korean). * Yunsung Lee*, JinYoung Kim*, Hyojun Go*, Myeongho Jeong, Shinhyeok Oh, **Seungtaek Choi**. Multi-Architecture Multi-Expert Diffusion Models. **AAAI 2024**. (2023) * Hyojun Go*, JinYoung Kim*, Yunsung Lee*, Seunghyun Lee*, Shinhyeok Oh, Hyeongdon Moon, **Seungtaek Choi**. Addressing Negative Transfer in Diffusion Models. **NeurIPS 2023**. * Jungbae Park, **Seungtaek Choi**. Addressing Cold Start Problem for End-to-end Automatic Speech Scoring. **INTERSPEECH 2023**. * Hyun Seung Lee*, **Seungtaek Choi***, Yunsung Lee, Hyeongdon Moon, Shinhyeok Oh, Myeongho Jeong, Hyojun Go, Christian Wallraven. Cross Encoding As Augmentation: Towards Effective Educational Text Classification. **Findings of ACL 2023**. * Shinhyeok Oh*, Hyojun Go*, Hyeongdon Moon, Yunsung Lee, Myeongho Jeong, Hyun Seung Lee, **Seungtaek Choi**. Evaluation of Question Generation Needs More References. **Findings of ACL 2023**. * Yeonjoon Jung, **Seungtaek Choi**, Seung-won Hwang, Jihyuk Kim, Minji Seo, Minsoo Kim. Retrieval-augmented Instructional Video Encoding for Dense Video Captioning. **Findings of ACL 2023**. * Dohyeon Lee, Seung-won Hwang, Kyungjae Lee, **Seungtaek Choi**, Sunghyun Park. On Complementarity Objectives for Hybrid Retrieval. **ACL 2023**. * Hyojun Go*, Yunsung Lee*, JinYoung Kim*, Seunghyun Lee, Myeongho Jeong, Hyun Seung Lee, **Seungtaek Choi**. Towards Practical Plug-and-Play Diffusion Models. **CVPR 2023**. (2022) * Hyeongdon Moon*, Yoonseok Yang*, Jamin Shin, Hangyeol Yu, Seunghyun Lee, Myeongho Jeong, Juneyoung Park, Minsam Kim, **Seungtaek Choi**. Evaluating the Knowledge Dependency of Questions. **EMNLP 2022**. * Hojae Han, Seung-won Hwang, Shuai Lu, Nan Duan, **Seungtaek Choi**. Towards Compositional Generalization in Code Search. **EMNLP 2022** (short). * Minji Seo*, Yeonjoon Jung*, **Seungtaek Choi**, Seung-won Hwang, Bei Liu. Debiasing Event Understanding for Visual Commonsense Tasks. **Findings of ACL 2022**. * **Seungtaek Choi***, Myeongho Jeong*, Hojae Han, Seung-won Hwang. C2L: Causally Contrastive Learning for Robust Text Classification. **AAAI 2022**. (2021) * Jihyuk Kim, Myeongho Jeong, **Seungtaek Choi**, Seung-won Hwang. Structure-Augmented Keyphrase Generation. **EMNLP 2021**. * Hojae Han, **Seungtaek Choi**, Myeongho Jeong, Jin-woo Park, Seung-won Hwang. Counterfactual Generative Smoothing for Imbalanced Natural Language Classification. **CIKM 2021** (short). * Myeongho Jeong*, **Seungtaek Choi***, Jinyoung Yeo, Seung-won Hwang. Label and Context Augmentation for Response Selection at DSTC8. **TASLP 2021** (2nd/3rd prize at DSTC8 Track2 Sub-task1). (2020) * **Seungtaek Choi***, Myeongho Jeong*, Jinyoung Yeo, Seung-won Hwang. Label-Efficient Training for Next Response Selection. **EMNLP 2020** (workshop, SustaiNLP). * Jihyeok Kim, **Seungtaek Choi**, Reinald Kim Amplayo, Seung-won Hwang. Retrieval-Augmented Controllable Review Generation. **COLING 2020**. * **Seungtaek Choi**, Haeju Park, Jinyoung Yeo, Seung-won Hwang. Less is More: Attention Supervision with Counterfactuals for Text Classification. **EMNLP 2020**. * Myeongho Jeong*, **Seungtaek Choi***, Hojae Han, Kyungho Kim, Seung-won Hwang. Conditional Response Augmentation for Dialogue using Knowledge Distillation. **INTERSPEECH 2020**. * **Seungtaek Choi**, Haeju Park, Seung-won Hwang. Meta-Supervision for Attention using Counterfactual Estimation. **DSEJ 2020** (Highly Rated ICDM Issue Invitation). (2019) * **Seungtaek Choi**, Haeju Park, Seung-won Hwang. Counterfactual Attention Supervision. **ICDM 2019** (short). * Hojae Han*, **Seungtaek Choi***, Haeju Park, Seung-won Hwang. MICRON: Multigranular Interaction for Contextualizing Representation in Non-factoid Question Answering. **EMNLP 2019** (short). (2018) * Jinyoung Yeo, Gyungbok Lee*, Gengyu Wang*, **Seungtaek Choi**, Hyunsouk Cho, Reinald Kim Amplayo, Seung-won Hwang. Visual Choice of Plausible Alternatives: An Evaluation of Image-based Commonsense Causal Reasoning. **LREC 2018**. * Jinyoung Yeo, Gengyu Wang, Hyunsouk Cho, **Seungtaek Choi**, Seung-won Hwang. Machine-translated Knowledge Transfer for Commonsense Causal Reasoning. **AAAI 2018**.