Lupa

Show document Help

A- | A+ | Print
Title:Is open source the future of AI? : a data-driven approach
Authors:ID Vake, Domen (Author)
ID Šinik, Bogdan (Author)
ID Vičič, Jernej (Author)
ID Tošić, Aleksandar (Author)
Files:.pdf RAZ_Vake_Domen_2025.pdf (606,12 KB)
MD5: 92A7906B18A94EF040FF44A7D6A66C78
 
URL https://www.mdpi.com/2076-3417/15/5/2790
 
Language:English
Work type:Article
Typology:1.01 - Original Scientific Article
Organization:FAMNIT - Faculty of Mathematics, Science and Information Technologies
Abstract:Large language models (LLMs) have become central to both academic research and industrial applications, fueling debates on their accuracy, usability, privacy, and potential misuse. While proprietary models benefit from substantial investments in data and computing resources, open-sourcing is often suggested as a means to enhance trust and transparency. Yet, open-sourcing comes with its own challenges, such as risks of illicit applications, limited financial incentives, and intellectual property concerns. Positioned between these extremes are hybrid approaches—including partially open models and licensing restrictions—that aim to balance openness with control. In this paper, we adopt a data-driven approach to examine the open-source development of LLMs. By analyzing contributions in model improvements, modifications, and methodologies, we assess how community efforts impact model performance. Our findings indicate that the open-source community can significantly enhance models, demonstrating that community-driven modifications can yield efficiency gains without compromising performance. Moreover, our analysis reveals distinct trends in community growth and highlights which architectures benefit disproportionately from open-source engagement. These insights provide an empirical foundation to inform balanced discussions among industry experts and policymakers on the future direction of AI development.
Keywords:large language models, artificial intelligence, open source, data science, HuggingFace
Publication version:Version of Record
Publication date:05.03.2025
Year of publishing:2025
Number of pages:str. 1-18
Numbering:Vol. 15, iss. 5, [article no.] 2790
PID:20.500.12556/RUP-21786 This link opens in a new window
UDC:004.8
ISSN on article:2076-3417
DOI:10.3390/app15052790 This link opens in a new window
COBISS.SI-ID:228028675 This link opens in a new window
Publication date in RUP:25.09.2025
Views:514
Downloads:4
Metadata:XML DC-XML DC-RDF
:
Copy citation
  
Average score:(0 votes)
Your score:Voting is allowed only for logged in users.
Share:Bookmark and Share


Hover the mouse pointer over a document title to show the abstract or click on the title to get all document metadata.

Record is a part of a journal

Title:Applied sciences
Shortened title:Appl. sci.
Publisher:MDPI
ISSN:2076-3417
COBISS.SI-ID:522979353 This link opens in a new window

Document is financed by a project

Funder:EC - European Commission
Project number:101135012
Name:Application-level Swarm-based Orchestration Across the Cloud-to-Edge Continuum
Acronym:Swarmchestrate

Funder:Other - Other funder or multiple funders
Project number:-
Name:BLOCKCHAIN BASED VERIFIABLE COMPUTATION
Acronym:BBVC

Licences

License:CC BY 4.0, Creative Commons Attribution 4.0 International
Link:http://creativecommons.org/licenses/by/4.0/
Description:This is the standard Creative Commons license that gives others maximum freedom to do what they want with the work as long as they credit the author.

Secondary language

Language:Slovenian
Title:Is open source the future of AI? A data-driven approach
Keywords:veliki jezikovni modeli, umetna inteligenca, odprta koda, podatkovna znanost, HuggingFace


Comments

Leave comment

You must log in to leave a comment.

Comments (0)
0 - 0 / 0
 
There are no comments!

Back
Logos of partners University of Maribor University of Ljubljana University of Primorska University of Nova Gorica