Search
Search Results
-
MnTTS2: An Open-Source Multi-speaker Mongolian Text-to-Speech Synthesis Dataset
Text-to-Speech (TTS) synthesis for low-resource languages is an attractive research issue in academia and industry nowadays. Mongolian is the... -
Who makes open source code? The hybridisation of commercial and open source practices
While Free and Open Source (F/OSS) coding has traditionally been described as a separate commons linked to values of openness and sharing, recent...
-
MOPRD: A multidisciplinary open peer review dataset
Open peer review is a growing trend in academic publications. Public access to peer review data can benefit both the academic and publishing...
-
Analyzing source code vulnerabilities in the D2A dataset with ML ensembles and C-BERT
Static analysis tools are widely used for vulnerability detection as they can analyze programs with complex behavior and millions of lines of code....
-
Challenges as catalysts: how Waymo’s Open Dataset Challenges shape AI development
Artificial intelligence (AI) and machine learning (ML) are becoming increasingly significant areas of research for scholars in science and technology...
-
Pairwise open-sourced dataSet protection based on adaptive blind watermarking
The cost of collecting and labeling open-sourced datasets which promote the development of deep learning is expensive. Thus, it is important to...
-
ProvSec: Open Cybersecurity System Provenance Analysis Benchmark Dataset with Labels
System provenance forensic analysis has been studied by a large body of research work. This area needs fine granularity data such as system calls...
-
GeoImageNet: a multi-source natural feature benchmark dataset for GeoAI and supervised machine learning
The field of GeoAI or Geospatial Artificial Intelligence has undergone rapid development since 2017. It has been widely applied to address...
-
Towards Measuring Vulnerabilities and Exposures in Open-Source Packages
Much of the current software depends on open-source components, which in turn have complex dependencies on other open-source libraries.... -
Open dataset discovery using context-enhanced similarity search
Today, open data catalogs enable users to search for datasets with full-text queries in metadata records combined with simple faceted filtering....
-
Comparative Analysis for Open-Source Large Language Models
Large Language Models (LLMs) have significantly advanced the field of Natural Language Processing (NLP), demonstrating exceptional performance across... -
Risk Assessment of Using Open Source Projects: Analysis of the Existing Approaches
AbstractThis article analyzes the existing approaches to assess and account for the components used in software, including open source software. The...
-
A novel image dataset for source camera identification and image based recognition systems
Multimodal emotion recognition has attracted a great deal of attention in recent years, with new interesting applications now being considered. One...
-
What can we learn from quality assurance badges in open-source software?
In the development of open-source software (OSS), many developers use badges to give an overview of the software and share some key features/metrics...
-
The software heritage license dataset (2022 edition)
Context:When software is released publicly, it is common to include with it either the full text of the license or licenses under which it is...
-
Comparative analysis of real issues in open-source machine learning projects
ContextIn the last decade of data-driven decision-making, Machine Learning (ML) systems reign supreme. Because of the different characteristics...
-
DIVOTrack: A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes
Cross-view multi-object tracking aims to link objects between frames and camera views with substantial overlaps. Although cross-view multi-object...
-
Open Source – GIS
This chapter provides an overview over the current spectrum of free and open source licensed geospatial software tools and communities. The number of...