| Title: | Is open source the future of AI? : a data-driven approach |
|---|
| Authors: | ID Vake, Domen (Author) ID Šinik, Bogdan (Author) ID Vičič, Jernej (Author) ID Tošić, Aleksandar (Author) |
| Files: | RAZ_Vake_Domen_2025.pdf (606,12 KB) MD5: 92A7906B18A94EF040FF44A7D6A66C78
https://www.mdpi.com/2076-3417/15/5/2790
|
|---|
| Language: | English |
|---|
| Work type: | Article |
|---|
| Typology: | 1.01 - Original Scientific Article |
|---|
| Organization: | FAMNIT - Faculty of Mathematics, Science and Information Technologies
|
|---|
| Abstract: | Large language models (LLMs) have become central to both academic research and industrial applications, fueling debates on their accuracy, usability, privacy, and potential misuse. While proprietary models benefit from substantial investments in data and computing resources, open-sourcing is often suggested as a means to enhance trust and transparency. Yet, open-sourcing comes with its own challenges, such as risks of illicit applications, limited financial incentives, and intellectual property concerns. Positioned between these extremes are hybrid approaches—including partially open models and licensing restrictions—that aim to balance openness with control. In this paper, we adopt a data-driven approach to examine the open-source development of LLMs. By analyzing contributions in model improvements, modifications, and methodologies, we assess how community efforts impact model performance. Our findings indicate that the open-source community can significantly enhance models, demonstrating that community-driven modifications can yield efficiency gains without compromising performance. Moreover, our analysis reveals distinct trends in community growth and highlights which architectures benefit disproportionately from open-source engagement. These insights provide an empirical foundation to inform balanced discussions among industry experts and policymakers on the future direction of AI development. |
|---|
| Keywords: | large language models, artificial intelligence, open source, data science, HuggingFace |
|---|
| Publication version: | Version of Record |
|---|
| Publication date: | 05.03.2025 |
|---|
| Year of publishing: | 2025 |
|---|
| Number of pages: | str. 1-18 |
|---|
| Numbering: | Vol. 15, iss. 5, [article no.] 2790 |
|---|
| PID: | 20.500.12556/RUP-21786  |
|---|
| UDC: | 004.8 |
|---|
| ISSN on article: | 2076-3417 |
|---|
| DOI: | 10.3390/app15052790  |
|---|
| COBISS.SI-ID: | 228028675  |
|---|
| Publication date in RUP: | 25.09.2025 |
|---|
| Views: | 514 |
|---|
| Downloads: | 4 |
|---|
| Metadata: |  |
|---|
|
:
|
Copy citation |
|---|
| | | | Average score: | (0 votes) |
|---|
| Your score: | Voting is allowed only for logged in users. |
|---|
| Share: |  |
|---|
Hover the mouse pointer over a document title to show the abstract or click
on the title to get all document metadata. |