UNiON Scholar: machine learning

arxiv.org · scholarly article

Reproducibility in Machine Learning for Health

Matthew B. A. McDermott; Shirly Wang; Nikki Marinsek; Rajesh Ranganath; Marzyeh Ghassemi; Luca Foschini

2019 arXiv Open Access

Machine learning algorithms designed to characterize, monitor, and intervene on human health (ML4H) are expected to perform safely and reliably when operating at scale, potentially outside strict human supervision. This requirement warrants a stricter attention to issues of reproducibility than other fields of machine learning. In this work, we conduct a systematic evaluation of over 100 recently published ML4H research papers along several dimensions related to reproducibility. We find that the field of ML4H compares poorly to more established machine learning fields, particularly concerning data and code accessibility. Finally, drawing from success in other fields of science, we propose recommendations to data providers, academic publishers, and the ML4H research community in order to promote reproducible research moving forward.

View Paper ↗ 📄 PDF 🎓 Explain with iON

arxiv.org · scholarly article

Active learning for data streams: a survey

Davide Cacciarelli; Murat Kulahci

2023 arXiv Open Access DOI: 10.1007/s10994-023-06454-2

Online active learning is a paradigm in machine learning that aims to select the most informative data points to label from a data stream. The problem of minimizing the cost associated with collecting labeled observations has gained a lot of attention in recent years, particularly in real-world applications where data is only available in an unlabeled form. Annotating each observation can be time-consuming and costly, making it difficult to obtain large amounts of labeled data. To overcome this issue, many active learning strategies have been proposed in the last decades, aiming to select the most informative observations for labeling in order to improve the performance of machine learning models. These approaches can be broadly divided into two categories: static pool-based and stream-based active learning. Pool-based active learning involves selecting a subset of observations from a closed pool of unlabeled data, and it has been the focus of many surveys and literature reviews. However, the growing availability of data streams has led to an increase in the number of approaches that focus on online active learning, which involves continuously selecting and labeling observations as they arrive in a stream. This work aims to provide an overview of the most recently proposed approaches for selecting the most informative observations from data streams in real time. We review the various techniques that have been proposed and discuss their strengths and limitations, as well as the challenges and opportunities that exist in this area of research.

View Paper ↗ 📄 PDF DOI 🎓 Explain with iON

arxiv.org · scholarly article

A Benchmark Study of Machine Learning Models for Online Fake News Detection

Junaed Younus Khan; Md. Tawkat Islam Khondaker; Sadia Afroz; Gias Uddin; Anindya Iqbal

2019 arXiv Open Access DOI: 10.1016/j.mlwa.2021.100032

The proliferation of fake news and its propagation on social media has become a major concern due to its ability to create devastating impacts. Different machine learning approaches have been suggested to detect fake news. However, most of those focused on a specific type of news (such as political) which leads us to the question of dataset-bias of the models used. In this research, we conducted a benchmark study to assess the performance of different applicable machine learning approaches on three different datasets where we accumulated the largest and most diversified one. We explored a number of advanced pre-trained language models for fake news detection along with the traditional and deep learning ones and compared their performances from different aspects for the first time to the best of our knowledge. We find that BERT and similar pre-trained models perform the best for fake news detection, especially with very small dataset. Hence, these models are significantly better option for languages with limited electronic contents, i.e., training data. We also carried out several analysis based on the models' performance, article's topic, article's length, and discussed different lessons learned from them. We believe that this benchmark study will help the research community to explore further and news sites/blogs to select the most appropriate fake news detection method.

View Paper ↗ 📄 PDF DOI 🎓 Explain with iON

arxiv.org · scholarly article

Privacy-preserving machine learning for healthcare: open challenges and future perspectives

Alejandro Guerra-Manzanares; L. Julian Lechuga Lopez; Michail Maniatakos; Farah E. Shamout

2023 arXiv Open Access DOI: 10.1007/978-3-031-39539-0_3

Machine Learning (ML) has recently shown tremendous success in modeling various healthcare prediction tasks, ranging from disease diagnosis and prognosis to patient treatment. Due to the sensitive nature of medical data, privacy must be considered along the entire ML pipeline, from model training to inference. In this paper, we conduct a review of recent literature concerning Privacy-Preserving Machine Learning (PPML) for healthcare. We primarily focus on privacy-preserving training and inference-as-a-service, and perform a comprehensive review of existing trends, identify challenges, and discuss opportunities for future research directions. The aim of this review is to guide the development of private and efficient ML models in healthcare, with the prospects of translating research efforts into real-world settings.

View Paper ↗ 📄 PDF DOI 🎓 Explain with iON

arxiv.org · scholarly article

Soil Property and Class Maps of the Conterminous US at 100 meter Spatial Resolution based on a Compilation of National Soil Point Observations and Machine Learning

Amanda Ramcharan; Tomislav Hengl; Travis Nauman; Colby Brungard; Sharon Waltman; Skye Wills; James Thompson

2017 arXiv Open Access DOI: 10.2136/sssaj2017.04.0122

With growing concern for the depletion of soil resources, conventional soil data must be updated to support spatially explicit human-landscape models. Three US soil point datasetswere combined with a stack of over 200 environmental datasets to generate complete coverage gridded predictions at 100 m spatial resolution of soil properties (percent organic C, total N, bulk density, pH, and percent sand and clay) and US soil taxonomic classes (291 great groups and 78 modified particle size classes) for the conterminous US. Models were built using parallelized random forest and gradient boosting algorithms. Soil property predictions were generated at seven standard soil depths (0, 5, 15, 30, 60, 100 and 200 cm). Prediction probability maps for US soil taxonomic classifications were also generated. Model validation results indicate an out-of-bag classification accuracy of 60 percent for great groups, and 66 percent for modified particle size classes; for soil properties cross-validated R-square ranged from 62 percent for total N to 87 percent for pH. Nine independent validation datasets were used to assess prediction accuracies for soil class models and results ranged between 24-58 percent and 24-93 percent for great group and modified particle size class prediction accuracies, respectively. The hybrid "SoilGrids+" modeling system that incorporates remote sensing data, local predictions of soil properties, conventional soil polygon maps, and machine learning opens the possibility for updating conventional soil survey data with machine learning technology to make soil information easier to integrate with spatially explicit models, compared to multi-component map units.

View Paper ↗ 📄 PDF DOI 🎓 Explain with iON

arxiv.org · scholarly article

Fourier Learning Machines: Nonharmonic Fourier-Based Neural Networks for Scientific Machine Learning

Mominul Rubel; Adam Meyers; Gabriel Nicolosi

2025 arXiv Open Access

We introduce the Fourier Learning Machine (FLM), a neural network (NN) architecture designed to represent a multidimensional nonharmonic Fourier series. The FLM uses a simple feedforward structure with cosine activation functions to learn the frequencies, amplitudes, and phase shifts of the series as trainable parameters. This design allows the model to create a problem-specific spectral basis adaptable to both periodic and nonperiodic functions. Unlike previous Fourier-inspired NN models, the FLM is the first architecture able to represent a multidimensional Fourier series with a complete set of basis functions in separable form, doing so by using a standard Multilayer Perceptron-like architecture. A one-to-one correspondence between the Fourier coefficients and amplitudes and phase-shifts is demonstrated, allowing for the translation between a full, separable basis form and the cosine phase-shifted one. Additionally, we evaluate the performance of FLMs on several scientific computing problems, including benchmark Partial Differential Equations (PDEs) and a family of Optimal Control Problems (OCPs). Computational experiments show that the performance of FLMs is comparable, and often superior, to that of established architectures like SIREN and vanilla feedforward NNs.

View Paper ↗ 📄 PDF 🎓 Explain with iON

arxiv.org · scholarly article

On the Origin of Species of Self-Supervised Learning

Samuel Albanie; Erika Lu; Joao F. Henriques

2021 arXiv Open Access

In the quiet backwaters of cs.CV, cs.LG and stat.ML, a cornucopia of new learning systems is emerging from a primordial soup of mathematics-learning systems with no need for external supervision. To date, little thought has been given to how these self-supervised learners have sprung into being or the principles that govern their continuing diversification. After a period of deliberate study and dispassionate judgement during which each author set their Zoom virtual background to a separate Galapagos island, we now entertain no doubt that each of these learning machines are lineal descendants of some older and generally extinct species. We make five contributions: (1) We gather and catalogue row-major arrays of machine learning specimens, each exhibiting heritable discriminative features; (2) We document a mutation mechanism by which almost imperceptible changes are introduced to the genotype of new systems, but their phenotype (birdsong in the form of tweets and vestigial plumage such as press releases) communicates dramatic changes; (3) We propose a unifying theory of self-supervised machine evolution and compare to other unifying theories on standard unifying theory benchmarks, where we establish a new (and unifying) state of the art; (4) We discuss the importance of digital biodiversity, in light of the endearingly optimistic Paris Agreement.

View Paper ↗ 📄 PDF 🎓 Explain with iON

arxiv.org · scholarly article

Watershed of Artificial Intelligence: Human Intelligence, Machine Intelligence, and Biological Intelligence

Li Weigang; Liriam Enamoto; Denise Leyi Li; Geraldo Pereira Rocha Filho

2021 arXiv Open Access

This article reviews the "Once learning" mechanism that was proposed 23 years ago and the subsequent successes of "One-shot learning" in image classification and "You Only Look Once - YOLO" in objective detection. Analyzing the current development of Artificial Intelligence (AI), the proposal is that AI should be clearly divided into the following categories: Artificial Human Intelligence (AHI), Artificial Machine Intelligence (AMI), and Artificial Biological Intelligence (ABI), which will also be the main directions of theory and application development for AI. As a watershed for the branches of AI, some classification standards and methods are discussed: 1) Human-oriented, machine-oriented, and biological-oriented AI R&D; 2) Information input processed by Dimensionality-up or Dimensionality-reduction; 3) The use of one/few or large samples for knowledge learning.

View Paper ↗ 📄 PDF 🎓 Explain with iON

arxiv.org · scholarly article

Quantitative Trading using Deep Q Learning

Soumyadip Sarkar

2023 arXiv Open Access DOI: 10.22214/ijraset.2023.50170

Reinforcement learning (RL) is a subfield of machine learning that has been used in many fields, such as robotics, gaming, and autonomous systems. There has been growing interest in using RL for quantitative trading, where the goal is to make trades that generate profits in financial markets. This paper presents the use of RL for quantitative trading and reports a case study based on an RL-based trading algorithm. The results show that RL can be a useful tool for quantitative trading and can perform better than traditional trading algorithms. The use of reinforcement learning for quantitative trading is a promising area of research that can help develop more sophisticated and efficient trading systems. Future research can explore the use of other reinforcement learning techniques, the use of other data sources, and the testing of the system on a range of asset classes. Together, our work shows the potential in the use of reinforcement learning for quantitative trading and the need for further research and development in this area. By developing the sophistication and efficiency of trading systems, it may be possible to make financial markets more efficient and generate higher returns for investors.

View Paper ↗ 📄 PDF DOI 🎓 Explain with iON

semanticscholar.org · scholarly article

On the Origin of Species of Self-Supervised Learning

Samuel Albanie; Erika Lu; João F. Henriques

2021 arXiv.org 📖 Cited 1 times

In the quiet backwaters of cs.CV, cs.LG and stat.ML, a cornucopia of new learning systems is emerging from a primordial soup of mathematics-learning systems with no need for external supervision. To date, little thought has been given to how these self-supervised learners have sprung into being or the principles that govern their continuing diversification. After a period of deliberate study and dispassionate judgement during which each author set their Zoom virtual background to a separate Galapagos island, we now entertain no doubt that each of these learning machines are lineal descendants of some older and generally extinct species. We make five contributions: (1) We gather and catalogue row-major arrays of machine learning specimens, each exhibiting heritable discriminative features; (2) We document a mutation mechanism by which almost imperceptible changes are introduced to the genotype of new systems, but their phenotype (birdsong in the form of tweets and vestigial plumage such as press releases) communicates dramatic changes; (3) We propose a unifying theory of self-supervised machine evolution and compare to other unifying theories on standard unifying theory benchmarks, where we establish a new (and unifying) state of the art; (4) We discuss the importance of digital biodiversity, in light of the endearingly optimistic Paris Agreement.

View Paper ↗ 🎓 Explain with iON