Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Evaluating the Code Quality of iOS Applications Generated by Large Language Models
Jönköping University, School of Engineering, JTH, Department of Computer Science and Informatics. Jönköping University, School of Engineering, JTH, Department of Computing.
2024 (English)Independent thesis Basic level (university diploma), 180 HE creditsStudent thesis
Abstract [en]

The rise of Large Language Models (LLMs) over the past years and their use in software development has raised questions and concerns about the quality of the code they generate. While LLMs such as ChatGPT have the potential to accelerate the development of new software and boost code productivity, their ability to generate high-quality code is debatable. This study presents an investigation into the code quality that is generated by LLMs, specifically focusing on iOS applications generated by ChatGPT.

To investigate the code quality of the generated iOS applications, the study implemented three movie applications in Swift with ChatGPT-3.5 and ChatGPT-4, along with a similar application written by a human developer. To measure the code quality, a set of code quality metrics was chosen to make a fair evaluation of the results.

The findings showed that ChatGPT can generate iOS applications with code quality comparable to a human-written application, and ChatGPT-4 in particular showed this capability. However, the ChatGPT-3.5 model showed inconsistent results, suggesting potential limitations in the current model. The level of human intervention required for development varies depending on the complexity of the task, with simpler tasks requiring less intervention. Overall, the study suggests that human intervention and guidance are essential for developing a working application.

Keywords: Code generation, Code Quality, Generative AI, iOS code quality, iOS development, Large Language Models

Place, publisher, year, edition, pages
2024. , p. 67
Keywords [en]
Code generation, Code Quality, Generative AI, iOS code quality, iOS development, Large Language Models
National Category
Computer Systems Computer Sciences Computer Engineering Software Engineering
Identifiers
URN: urn:nbn:se:hj:diva-65556OAI: oai:DiVA.org:hj-65556DiVA, id: diva2:1882108
External cooperation
Verendus System AB
Subject / course
JTH, Computer Engineering
Presentation
2024-05-28, E1029, JTH, 553 18 Jönköping, Jönköping, 11:00 (English)
Supervisors
Examiners
Available from: 2024-07-22 Created: 2024-07-04 Last updated: 2025-10-13Bibliographically approved

Open Access in DiVA

fulltext(6052 kB)708 downloads
File information
File name FULLTEXT01.pdfFile size 6052 kBChecksum SHA-512
a1c0616ed71b4ac182515c34cc5b36bb118246b98de941136ff6ee8cabde062c5b84ef37ba0b714a15568e679a9d0762670bfbd8d74839a739999d9f1bc6a298
Type fulltextMimetype application/pdf

By organisation
JTH, Department of Computer Science and InformaticsJTH, Department of Computing
Computer SystemsComputer SciencesComputer EngineeringSoftware Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 710 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 1237 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf