Please use this identifier to cite or link to this item:
http://hdl.handle.net/20.500.12323/7677
Title: | A comparative quality assessment of ChatGPT-4 and human translation of scientific texts |
Other Titles: | ChatGPT-4 və Insan tərəfindən elmi mətnlərin tərcüməsinin müqayisəli keyfiyyət qiymətləndirilməsi |
Authors: | Ahmadova, Sabina |
Keywords: | AI (Artificial Intelligence) neural networks deep learning natural language processing (NLP) large language model(LLM) COMET-22 BLEURT-20 metrics ChatGPT-4 ChatGPT-3.5 machine translation human translation scientific texts |
Issue Date: | 2024 |
Series/Report no.: | ;Master thesis |
Abstract: | You may not believe but we all have already surrounded by different technologies and have close connection with AI (Artificial Intelligence). For instance, we follow the needed destination using the voice-guided navigation of car’s GPS (Global Positioning System) or ask Alice or Siri voice assistants to dial number from the friends’ list. In various fields, we try to create automated advanced systems and applications to reduce our work. We have created robotic vacuum cleaners, wash machines, “smart house” systems, which distantly could be controlled by phone or PC, and we go forward, working on 3D printed Animal and Human Prosthetics to make happier disabled people. Undoubtedly, the Translation Science also undergoes significant changes and we are all witnesses to the inventions that people have waited for and tried to achieve for several decades. In 2022, the world appreciated for the large language model ChatGPT-3.5, one year later for ChatGPT-4, introduced and intensively developed by OpenAI company. These Generative Pretrained Transformer (GPT) language models trained on a huge network dataset, can generate texts, the latest language models translate myriad languages, solve math problems, answer questions, create imagines and just can be a good interlocutor, carrying on a dialogue. This paper aims to examine and compare translations of scientific texts from English to Russian performed by ChatGPT-3.5, ChatGPT-4, and human translators. The study conducts a comparative quality analysis of these translations and utilizes the neural-based machine translation evaluation metrics COMET-22 and BLEURT-20. A key goal of this paper is to identify errors, and shed the light on the strengths and limitations primarily of ChatGPT-4 model. Additionally, the paper finds out improvements of ChatGPT-4 capabilities and difference with its predecessor ChatGPT-3.5, and estimates their performance in comparison to human translation. Using mixed-method empirical research, this study analyzed 30 scientific articles, assessing the efficiency and capability of GPT models. |
Description: | Faculty: Graduate School of Science, Arts and Technology Department: English Language and Literature Speciality: Translation Supervisor: Prof. Dr. Huseynagha Rzayev |
URI: | http://hdl.handle.net/20.500.12323/7677 |
Appears in Collections: | Thesis |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
A comparative quality assessment of ChatGPT-4 and human translation of scientific texts.pdf | 1.34 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.