EXPLORING AI-DRIVEN PROGRAMMING EXERCISE GENERATION

J. Rytilahti, O. Weerakoon, E. Kaila (2024). EXPLORING AI-DRIVEN PROGRAMMING EXERCISE GENERATION.

2024 20th International CDIO Conference, ESPRIT, Tunisia

Large language models(LLMs) are transforming how teachers work. In this paper, we observe several experimental approaches to generating software programming exercises by utilizing ChatGPT, a popular and open LLM. The generation of these exercises was tightly connected to a large Python programming course that was targeted at students studying in Information Technology, Software Engineering, and Computing.

We experimented with three separate approaches. In the first one, we generated new programming exercises with a specific topic using theme injection. In the second one, we generated variations of existing programming exercises by changing the theme or content. In the third one, we generated hybrid exercises by injecting original programming exercises with additional topics or other related exercises.

Based on our results, all three approaches showed potential but also revealed limitations. The exercise generation with theme injection can produce fully functional exercises. However, these exercises could appear to students as too generic or erroneous. The exercise variations seem to retain the semantic meaning of the original exercise quite well while still using different context. We also tested the variations in a large introductory programming course and found out that the students could not distinguish them from human-generated exercises in style or quality. The hybrid exercises were built upon the idea of exploring how close we are to fully adaptive learning environments in the field of programming education. The current results of this approach show that we need to do further experimentation to maybe reach the goal.

All in all, it was evident that LLMs can be a useful tool in assisting teachers in generating exercises. Even with certain coherent limitations, they are useful in particular cases. We conclude our article by discussing the future possibilities of LLMs, including but not limited to dynamic, automatically generated exercises and fully adaptive learning environments.

Authors (New):

Juuso Rytilahti

Oshani Weerakoon

Erkki Kaila

Affiliations:

University of Turku, Turku, Finland

Keywords:

AI in education

Large Language Models (LLM)

Programming Education

Pedagogical Tools

ChatGPT

CDIO Standard 2

CDIO Standard 3

CDIO Standard 5

CDIO Standard 8

CDIO Standard 9

CDIO Standard 11

Year:

2024

Reference:

Bubeck, S., Chandrasekaran, V., Eldan, R., Gehrke, J., Horvitz, E., Kamar, E., . . . Zhang, Y. (2023). Sparks of artificial general intelligence: Early experiments with gpt-4. arXiv preprint arXiv:2303.12712.:

https://doi.org/10.48550/arXiv.2303.12712

Crawford, J., Cowling, M., & Allen, K.-A. (2023). Leadership is needed for ethical chatgpt: Character, assessment, and learning using artificial intelligence (ai). Journal of University Teaching and Learning Practice. doi: 10.53761/1.20.3.02:

https://doi.org/10.53761/1.20.3.02

Dave, P. (2023, May 31). Chatgpt is cutting non-english languages out of the ai revolution. https://www.wired.com/story/chatgpt-non-english-languages-ai -revolution/. WIRED.:

Deng, J., & Lin, Y. (2022). The benefits and challenges of chatgpt: An overview. Frontiers in Computing and Intelligent Systems, 2(2), 81–83.:

https://doi.org/10.54097/fcis.v2i2.4465

Dettmers, T., Pagnoni, A., Holtzman, A., & Zettlemoyer, L. (2023). Qlora: Efficient finetuning of quantized llms. arXiv preprint arXiv:2305.14314.:

https://doi.org/10.48550/arXiv.2305.14314

Nelson, K., & Creagh, T. (2023). Editorial volume 14 issue 1 2023. Student Success. doi: 10.5204/ssj.2860:

https://doi.org/10.5204/ssj.2860

OpenAI. (2023). Gpt-4 technical report.:

Ouyang, L., Wu, J., Jiang, X., Almeida, D., Wainwright, C., Mishkin, P., . . . Lowe, R. (2022). Training language models to follow instructions with human feedback. Advances in Neural Information Processing Systems, 35, 27730–27744.:

https://doi.org/10.48550/arXiv.2203.02155

Sovietov, P. (2022). Automatic generation of programming exercises. arXiv.:

https://doi.org/10.1109/TELE52840.2021.9482762

Speth, S., Meissner, N., & Becker, S. (2023). Investigating the use of ai-generated exercises for beginner and intermediate programming courses: A chatgpt case study. In Csee&t 2023. (This study explores the use of AI, particularly ChatGPT, in creating programming exercises and assesses their effectiveness in an educational setting.):

http://dx.doi.org/10.1109/CSEET58097.2023.00030

Team, G., Anil, R., Borgeaud, S., Wu, Y., Alayrac, J.-B., Yu, J., . . . Ahn, J. (2023). Gemini: a family of highly capable multimodal models. arXiv preprint arXiv:2312.11805.:

https://doi.org/10.48550/arXiv.2312.11805

Wang, K., Singh, R., & Su, Z. (n.d.). Search, align, and repair: Data-driven feedback generation for introductory programming exercises. https://arxiv.org/. Retrieved from https://arxiv .org/pdf/1711.07148.pdf:

https://doi.org/10.48550/arXiv.1711.07148

on July 4, 2024 - 08:10 Anonymous (not verified)

EXPLORING AI-DRIVEN PROGRAMMING EXERCISE GENERATION

Navigation

EXPLORING AI-DRIVEN PROGRAMMING EXERCISE GENERATION

Share this content.