UNiON Scholar
UNiON Web Scholar iON AI About Scholar
334 scholarly results for stat.AP
Scholar iON Academic Synthesis
This collection of scholarly papers explores statistical approaches and methodologies in applied contexts, highlighting significant advancements in biological and ecological data analysis. Li et al. (2004) and Fury et al. (2006) both focus on gene selection criteria in microarray data analysis, emphasizing the computational efficiency of extreme value distributions and the probabilistic overlap of gene lists, respectively. Mozaffarilegha and Movahed (2019) delve into the multifractal nature of auditory brainstem responses, identifying long-range temporal correlations that suggest complex underlying neural mechanisms. Vilk et al. (2024) investigate animal movement patterns through anomalous diffusion models, revealing age-specific ecological behaviors. Collectively, these studies underscore the importance of sophisticated statistical tools in uncovering intricate patterns within biological datasets, facilitating deeper insights into genetic, neurological, and ecological phenomena.
πŸŽ“ Deep dive with Scholar iON β†’
arxiv.org Β· scholarly article
Extreme Value Distribution Based Gene Selection Criteria for Discriminant Microarray Data Analysis Using Logistic Regression
Wentian Li; Fengzhu Sun; Ivo Grosse
2004 arXiv Open Access DOI: 10.1089/1066527041410445
One important issue commonly encountered in the analysis of microarray data is to decide which and how many genes should be selected for further studies. For discriminant microarray data analyses based on statistical models, such as the logistic regression models, gene selection can be accomplished by a comparison of the maximum likelihood of the model given the real data, $\hat{L}(D|M)$, and the expected maximum likelihood of the model given an ensemble of surrogate data with randomly permuted label, $\hat{L}(D_0|M)$. Typically, the computational burden for obtaining $\hat{L}(D_0|M)$ is immense, often exceeding the limits of computing available resources by orders of magnitude. Here, we propose an approach that circumvents such heavy computations by mapping the simulation problem to an extreme-value problem. We present the derivation of an asymptotic distribution of the extreme-value as well as its mean, median, and variance. Using this distribution, we propose two gene selection criteria, and we apply them to two microarray datasets and three classification tasks for illustration.
arxiv.org Β· scholarly article
Long-range temporal correlation in Auditory Brainstem Responses to Spoken Syllable /da/
Marjan Mozaffarilegha; S. M. S. Movahed
2019 arXiv Open Access DOI: 10.1038/s41598-018-38215-w
The speech auditory brainstem response (sABR) is an objective clinical tool to diagnose particular impairments along the auditory brainstem pathways. We explore the scaling behavior of the brainstem in response to synthetic /da/ stimuli using a proposed pipeline including Multifractal Detrended Moving Average Analysis (MFDMA) modified by Singular Value Decomposition. The scaling exponent confirms that all normal sABR are classified into the non-stationary process. The average Hurst exponent is $H=0.77\pm0.12$ at 68\% confidence interval indicating long-range correlation which shows the first universality behavior of sABR. Our findings exhibit that fluctuations in the sABR series are dictated by a mechanism associated with long-term memory of the dynamic of the auditory system in the brainstem level. The $q-$dependency of $h(q)$ demonstrates that underlying data sets have multifractal nature revealing the second universality behavior of the normal sABR samples. Comparing Hurst exponent of original sABR with the results of the corresponding shuffled and surrogate series, we conclude that its multifractality is almost due to the long-range temporal correlations which are devoted to the third universality. Finally, the presence of long-range correlation which is related to the slow timescales in the subcortical level and integration of information in the brainstem network is confirmed.
arxiv.org Β· scholarly article
Strong anomalous diffusion for free-ranging birds
Ohad Vilk; Motti Charter; Sivan Toledo; Eli Barkai; Ran Nathan
2024 arXiv Open Access
Diffusion and anomalous diffusion are widely observed and used to study movement across organisms, resulting in extensive use of the mean and mean-squared displacement (MSD). However, these measures - corresponding to specific displacement moments - do not capture the full complexity of movement behavior. Using high-resolution data from over 70 million localizations of young and adult free-ranging Barn Owls (\textit{Tyto alba}), we reveal strong anomalous diffusion as nonlinear growth of displacement moments. The moment spectrum function $Ξ»_t(q)$ -- defined by $\left<|\bm{x}(t)|^q\right> \sim t^{Ξ»_t(q)}$ -- displays piecewise linearity in $q$, with a critical moment marking the crossover between scaling regimes. This highlights the need of a broad spectrum of displacement moments to characterize movement, which we link to age-specific ecological drivers. Furthermore, a characteristic timescale of five minutes marks an unexpected transition from a convex to a concave $Ξ»_t(q)$. Using two stochastic models - a bounded LΓ©vy walk and a multi-mode behavioral model - we account for the observed phenomena, showing good agreement with data, relating age-specific behavioral states to environmentally confined movement, and demonstrating how LΓ©vy walk-like patterns can arise from underlying behavioral structure. Finally, we discuss the ecological significance of our results, arguing that strong anomalous diffusion may be widespread in animal movement.
arxiv.org Β· scholarly article
Overlapping Probabilities of Top Ranking Gene Lists, Hypergeometric Distribution, and Stringency of Gene Selection Criterion
Wen Fury; Franak Batliwalla; Peter K. Gregersen; Wentian Li
2006 arXiv Open Access DOI: 10.1109/IEMBS.2006.260828
When the same set of genes appear in two top ranking gene lists in two different studies, it is often of interest to estimate the probability for this being a chance event. This overlapping probability is well known to follow the hypergeometric distribution. Usually, the lengths of top-ranking gene lists are assumed to be fixed, by using a pre-set criterion on, e.g., $p$-value for the t-test. We investigate how overlapping probability changes with the gene selection criterion, or simply, with the length of the top-ranking gene lists. It is concluded that overlapping probability is indeed a function of the gene list length, and its statistical significance should be quoted in the context of gene selection criterion.
arxiv.org Β· scholarly article
The Sustainability Gap in Robotics: A Large-Scale Survey of Sustainability Awareness in 50,000 Research Articles
Antun Skuric; Leandro Von Werra; Thomas Wolf
2026 arXiv Open Access
We present a large-scale survey of sustainability communication and motivation in robotics research. Our analysis covers nearly 50,000 open-access papers from arXiv's cs.RO category published between 2015 and early 2026. In this study, we quantify how often papers mention social, ecological, and sustainability impacts, and we analyse their alignment with the UN Sustainable Development Goals (SDGs). The results reveal a persistent gap between the field's potential and its stated intent. While a large fraction of robotics papers can be mapped to SDG-relevant domains, explicit sustainability motivation remains remarkably low. Specifically, mentions of sustainability-related impacts are typically below 2%, explicit SDG references stay below 0.1%, and the proportion of sustainability-motivated papers remains below 5%. These trends suggest that while the field of robotics is advancing rapidly, sustainability is not yet a standard part of research framing. We conclude by proposing concrete actions for researchers, conferences, and institutions to close these awareness and motivation gaps, supporting a shift toward more intentional and responsible innovation.
arxiv.org Β· scholarly article
Why Autonomous Vehicles Are Not Ready Yet: A Multi-Disciplinary Review of Problems, Attempted Solutions, and Future Directions
Xingshuai Dong; Max Cappuccio; Hamad Al Jassmi; Fady Alnajjar; Essam Debie; Milad Ghasrikhouzani; Alessandro Lanteri; Ali Luqman; Tate McGregor; Oleksandra Molloy; Alice Plebe; Michael Regan; Dongmo Zhang
2023 arXiv Open Access
Personal autonomous vehicles are cars, trucks and bikes capable of sensing their surrounding environment, planning their route, and driving with little or no involvement of human drivers. Despite the impressive technological achievements made by the industry in recent times and the hopeful announcements made by leading entrepreneurs, to date no personal vehicle is approved for road circulation in a 'fully' or 'semi' autonomous mode (autonomy levels 4 and 5) and it is still unclear when such vehicles will eventually be mature enough to receive this kind of approval. The present review adopts an integrative and multidisciplinary approach to investigate the major challenges faced by the automative sector, with the aim to identify the problems that still trouble and delay the commercialization of autonomous vehicles. The review examines the limitations and risks associated with current technologies and the most promising solutions devised by the researchers. This negative assessment methodology is not motivated by pessimism, but by the aspiration to raise critical awareness about the technology's state-of-the-art, the industry's quality standards, and the society's demands and expectations. While the survey primarily focuses on the applications of artificial intelligence for perception and navigation, it also aims to offer an enlarged picture that links the purely technological aspects with the relevant human-centric aspects, including, cultural attitudes, conceptual assumptions, and normative (ethico-legal) frameworks. Examining the broader context serves to highlight problems that have a cross-disciplinary scope and identify solutions that may benefit from a holistic consideration.
arxiv.org Β· scholarly article
Identifying the Development and Application of Artificial Intelligence in Scientific Text
James Dunham; Jennifer Melot; Dewey Murdick
2020 arXiv Open Access
We describe a strategy for identifying the universe of research publications relevant to the application and development of artificial intelligence. The approach leverages the arXiv corpus of scientific preprints, in which authors choose subject tags for their papers from a set defined by editors. We compose a functional definition of AI relevance by learning these subjects from paper metadata, and then inferring the arXiv-subject labels of papers in larger corpora: Clarivate Web of Science, Digital Science Dimensions, and Microsoft Academic Graph. This yields predictive classification $F_1$ scores between .75 and .86 for Natural Language Processing (cs.CL), Computer Vision (cs.CV), and Robotics (cs.RO). For a single model that learns these and four other AI-relevant subjects (cs.AI, cs.LG, stat.ML, and cs.MA), we see precision of .83 and recall of .85. We evaluate the out-of-domain performance of our classifiers against other sources of topic information and predictions from alternative methods. We find that a supervised solution can generalize to identify publications that belong to the high-level fields of study represented on arXiv. This offers a method for identifying AI-relevant publications that updates at the pace of research output, without reliance on subject-matter experts for query development or labeling.
arxiv.org Β· scholarly article
Autonomous motion in changing environment, fibrations and reaction mechanisms
Michael Farber; Stefan Kurz; Mathias Pillin
2025 arXiv Open Access
In this paper we develop further the formalism of fibrations of configuration spaces as a tool for modelling motion of autonomous systems in variable environments. We analyse the situations when the external conditions may change during the motion of the system and analyse two possibilities: (a) when the behaviour of the external conditions is known in advance; and (b) when the future changes of the external conditions are unknown but we can measure the current state and the current velocity of the external conditions, at every moment of time. We prove that in the case (a) the complexity of the motion algorithm is the same as in the case of constant external conditions; this generalises the result of \cite{FGY}. In case (b) we introduce a new concept of a reaction mechanism which allows to take into account unexpected and unpredictable changes in the environment. A reaction mechanism is mathematically an infinitesimal lifting function on a fibre bundle, a nonlinear generalisation of the classical concept of an Ehresmann connection. We illustrate these notions by examples which show that nonlinear infinitesimal lifting function (reaction mechanisms) appear naturally, are inevitable and ubiquitous.
arxiv.org Β· scholarly article
FactoryBench: Evaluating Industrial Machine Understanding
Yanis Merzouki; Coral Izquierdo; Matei Ignuta-Ciuncanu; Marcos Gomez-Bracamonte; Riccardo Maggioni; Alessandro Lombardi; Camilla Mazzoleni; Federico Martelli; Balazs Gunther; Jonas Petersen; Philipp Petersen
2026 arXiv Open Access
We introduce FactoryBench, a benchmark for evaluating time-series models and LLMs on machine understanding over industrial robotic telemetry. Q&A pairs are organized along four causal levels (state, intervention, counterfactual, decision) instantiating Pearl's ladder of causation, and span five answer formats: four structured formats are scored deterministically and free-form answers are scored by an LLM-as-judge voting protocol. We propose a scalable Q&A generation framework built around structured question templates, present FactoryWave (a dense, multitask, multivariate sensor dataset collected from a UR3 cobot and a KUKA KR10 industrial arm), and construct FactoryBench as a large-scale benchmark of over 70k Q&A items grounded in roughly 15k normalized episodes from FactoryWave, AURSAD, and voraus-AD. Zero-shot evaluation of six frontier LLMs shows that no model exceeds 50% on structured levels or 18% on decision-making, revealing a wide gap between current models and operational machine understanding.
arxiv.org Β· scholarly article
RAUNet: Residual Attention U-Net for Semantic Segmentation of Cataract Surgical Instruments
Zhen-Liang Ni; Gui-Bin Bian; Xiao-Hu Zhou; Zeng-Guang Hou; Xiao-Liang Xie; Chen Wang; Yan-Jie Zhou; Rui-Qi Li; Zhen Li
2019 arXiv Open Access
Semantic segmentation of surgical instruments plays a crucial role in robot-assisted surgery. However, accurate segmentation of cataract surgical instruments is still a challenge due to specular reflection and class imbalance issues. In this paper, an attention-guided network is proposed to segment the cataract surgical instrument. A new attention module is designed to learn discriminative features and address the specular reflection issue. It captures global context and encodes semantic dependencies to emphasize key semantic features, boosting the feature representation. This attention module has very few parameters, which helps to save memory. Thus, it can be flexibly plugged into other networks. Besides, a hybrid loss is introduced to train our network for addressing the class imbalance issue, which merges cross entropy and logarithms of Dice loss. A new dataset named Cata7 is constructed to evaluate our network. To the best of our knowledge, this is the first cataract surgical instrument dataset for semantic segmentation. Based on this dataset, RAUNet achieves state-of-the-art performance 97.71% mean Dice and 95.62% mean IOU.