Cross-Cutting Themes

Publications

Hejna, J., Rafailov, R., Sikchi, H., Finn, C., Niekum, S., Knox, W. B., & Sadigh, D. (2024). Contrastive Preference Learning: Learning from Human Feedback without RL. https://doi.org/10.48550/ARXIV.2310.13639

Jensen, J., & Murthy, D. (2025). Communicating for collaboration in AI development teams.

Jensen, J., & Murthy, D. (2025). What is being reimagined? Creativity, aura, and generative AI as the automation of remix.

Jensen, J., Murthy, D., & Baker, Samuel (2025). Automating remix: Generative AI, creative labor, and the decay of aura.

Muslimani, C., Chandramouli, S., Booth, S., Knox, B. W., & Taylor, M. E. (2024). Analyzing Reward Functions via Trajectory Alignment. https://openreview.net/pdf?id=Shnso8m57C

Muslimani, C., Johnstonbaugh, K., Chandramouli, S., Booth, S., Knox, W. B., & Taylor, M. E. (2025). Towards Improving Reward Design in RL: A Reward Alignment Metric for RL Practitioners. https://doi.org/10.48550/arXiv.2503.05996

Rafailov, R., Chittepu, Y., Park, R., Sikchi, H., Hejna, J., Knox, B., Finn, C., & Niekum, S. (2024). Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms. https://doi.org/10.48550/ARXIV.2406.02900

Zhang, M. J. Q., Knox, W. B., & Choi, E. (2025). Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions. https://doi.org/10.48550/arXiv.2410.13788