Publications – Woojung Song

Conferences & Journals

Agents

Don't Adapt Small Language Models for Tools; Adapt Tool Schemas to the Models

ACL 2026 (Main)

Jonggeun Lee*, Woojung Song*, Jongwook Han, Haesung Pyun, Yohan Jo Co-first

PA-Tool renames tool schema components to match small language models' pretraining vocabulary, improving tool-use accuracy by up to 17% without any model retraining.

Value Alignment

Human Psychometric Questionnaires Mischaracterize LLM Psychology: Evidence from Generation Behavior

Under Review, 2026

Woojung Song*, Dongmin Choi*, Yoonah Park, Jongwook Han, Eun-Ju Lee, Yohan Jo Co-first

Reveals a gap between LLM psychological profiles measured by standard questionnaires and those observed in actual generation behavior, questioning the validity of questionnaire-based assessments.

Agents

Non-Collaborative User Simulators for Tool Agents

ICLR 2026

Jeonghoon Shim, Woojung Song, Cheyon Jin, Seungwon Kook, Yohan Jo

A user simulator that generates realistic non-cooperative behaviors, exposing significant performance drops in state-of-the-art tool agents under adversarial conditions.

Value Alignment

Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators

TACL, 2026

Sungjib Lim, Woojung Song, Eunju Lee, Yohan Jo

Models trait-response mediators to turn LLMs into virtual survey respondents, enabling cost-effective validation of psychometric items across Big Five, Schwartz values, and VIA character strengths.

Value Alignment

Quantifying Data Contamination in Psychometric Evaluations of LLMs

EACL 2026 (Findings)

Jongwook Han*, Woojung Song*, Jonggeun Lee*, Yohan Jo Co-first

A three-level framework analyzing item memorization, evaluation memorization, and target score matching to quantify how data contamination undermines psychometric LLM benchmarks.

Value Alignment

Value Portrait: Assessing Language Models' Values through Psychometrically and Ecologically Valid Items

ACL 2025 (Main)

Jongwook Han*, Dongmin Choi*, Woojung Song*, Eun-Ju Lee, Yohan Jo Co-first

An evaluation framework grounded in real user-LLM interactions, finding that 44 LLMs consistently prioritize Benevolence and Self-Direction while undervaluing Tradition and Power.

Applied ML

Interpretable Prediction of Private Brand Purchases by Pet Type in E-Commerce for Consumer Behavior Analysis Using Real-World Transaction Data

PeerJ CS, 2026

Jaehyuk Lee*, Woojung Song*, Jina Kim, Yoona Chung, Eunchan Kim Co-first

Applies interpretable machine learning to real e-commerce transaction data to analyze private brand purchasing patterns across dog and cat owner segments, finding that dog owners respond more strongly to delivery convenience while cat owners show greater price sensitivity.

Domestic Publications

Education

B. F. Sword: Blank Inference System with Optimized RAG and Knowledge Distillation

HCI Korea, 2025

Junseong Pyo, Hyejin Bae, Kitae Kwon, Jina Kim, Woojung Song, Yeonseong Shin

An LLM-based system for generating Korean college entrance exam English reading comprehension questions, combining abductive reasoning with RAG and knowledge distillation.

Applied ML

Enhancing the Security of AI Models through Parameter Encryption

KCC, 2025

Dongwook Kim, Woojung Song

A cryptographic access control technique that encrypts AI model parameters for secure on-device deployment, leveraging the ELU activation function under homomorphic encryption.