Scholar iON
Academic Synthesis
The collected scholarly works reflect significant advancements in computational methods for language processing, persona generation, lie detection in language models, and infectious disease modeling. Steinberger's study on cross-lingual keyword assignment leverages a multilingual thesaurus for effective cross-language document comparison, highlighting the challenges of language-independent keyword assignment. Shigemura's research on noise-driven persona formation in neural language models emphasizes the impact of stochastic processes on emergent linguistic behaviors, providing a framework to study reflexive generation dynamics. Kretschmar et al.'s LIARS' BENCH introduces a comprehensive testbed for evaluating the reliability of lie detection techniques in LLMs, revealing current limitations and guiding future improvements. Meanwhile, Wang et al.'s extension of the SIR model incorporates temporal and spatial dynamics to enhance epidemic modeling, demonstrating the application of advanced system inference techniques to real-world data. Collectively, these studies underscore the importance of robust computational frameworks in understanding complex systems across diverse domains.
This paper presents a language-independent approach to controlled vocabulary keyword assignment using the EUROVOC thesaurus. Due to the multilingual nature of EUROVOC, the keywords for a document written in one language can be displayed in all eleven official European Union languages. The mapping of documents written in different languages to the same multilingual thesaurus furthermore allows cross-language document comparison. The assignment of the controlled vocabulary thesaurus descriptors is achieved by applying a statistical method that uses a collection of manually indexed documents to identify, for each thesaurus descriptor, a large number of lemmas that are statistically associated to the descriptor. These associated words are then used during the assignment procedure to identify a ranked list of those EUROVOC terms that are most likely to be good keywords for a given document. The paper also describes the challenges of this task and discusses the achieved results of the fully functional prototype.
This paper introduces the Luca-Noise Reflex Protocol (LN-RP), a computational framework for analyzing noise-driven persona emergence in large language models. By injecting stochastic noise seeds into the initial generation state, we observe nonlinear transitions in linguistic behavior across 152 generation cycles. Our results reveal three stable persona modes with distinct entropy signatures, and demonstrate that external noise sources can reliably induce phase transitions in reflexive generation dynamics. Quantitative evaluation confirms consistent persona retention and significant differences across modes (p < 0.01). The protocol provides a reproducible method for studying reflexive generation, emergent behavior, and longrange linguistic coherence in LLMs.
Prior work has introduced techniques for detecting when large language models (LLMs) lie, that is, generate statements they believe are false. However, these techniques are typically validated in narrow settings that do not capture the diverse lies LLMs can generate. We introduce LIARS' BENCH, a testbed consisting of 72,863 examples of lies and honest responses generated by four open-weight models across seven datasets. Our settings capture qualitatively different types of lies and vary along two dimensions: the model's reason for lying and the object of belief targeted by the lie. Evaluating three black- and white-box lie detection techniques on LIARS' BENCH, we find that existing techniques systematically fail to identify certain types of lies, especially in settings where it's not possible to determine whether the model lied from the transcript alone. Overall, LIARS' BENCH reveals limitations in prior techniques and provides a practical testbed for guiding progress in lie detection.
We extend the classical SIR model of infectious disease spread to account for time dependence in the parameters, which also include diffusivities. The temporal dependence accounts for the changing characteristics of testing, quarantine and treatment protocols, while diffusivity incorporates a mobile population. This model has been applied to data on the evolution of the COVID-19 pandemic in the US state of Michigan. For system inference, we use recent advances; specifically our framework for Variational System Identification (Wang et al., Comp. Meth. App. Mech. Eng., 356, 44-74, 2019; arXiv:2001.04816 [cs.CE]) as well as Bayesian machine learning methods.
This paper encompasses a super helical memory system's design, 'Boolean logic & image-logic' as a theoretical concept of an invention-model to 'store time-data' in terms of anticipating the best memory location ever for data/time. A waterfall effect is deemed to assist the process of potential-difference output-switch into diverse logic states in quantum dot computational methods via utilizing coiled carbon nanotubes (CCNTs) and carbon nanotube field effect transistors (CNFETs). A 'quantum confinement' is thus derived for a flow of particles in a categorized quantum well substrate with a normalized capacitance rectifying high B-field flux into electromagnetic induction. Multi-access of coherent sequences of 'qubit addressing' is gained in any magnitude as pre-defined for the orientation of array displacement. Briefly, Gaussian curvature of k<0 is debated in aim of specifying the 2D electron gas characteristics in scenarios where data is stored in short intervals versus long ones e.g. when k'>(k<0) for greater CCNT diameters, space-time continuum is folded by chance for the particle. This benefits from Maxwell-Lorentz theory in Minkowski's space-time viewpoint alike to crystal oscillators for precise data timing purposes and radar systems e.g., time varying self-clocking devices in diverse geographic locations. This application could also be optional for data depository versus extraction, in the best supercomputer system's locations, autonomously. For best performance in minimizing current limiting mechanisms including electromigration, a multilevel metallization and implant process forming elevated sources/drains for the circuit's staircase pyramidal construction, is discussed accordingly.
Bioinformatics is a new discipline that addresses the need to manage and interpret the data that in the past decade was massively generated by genomic research. This discipline represents the convergence of genomics, biotechnology and information technology, and encompasses analysis and interpretation of data, modeling of biological phenomena, and development of algorithms and statistics. This article presents an introduction to bioinformatics
Slicing is a program analysis technique originally developed for imperative languages. It facilitates understanding of data flow and debugging.
This paper discusses slicing of Constraint Logic Programs. Constraint Logic Programming (CLP) is an emerging software technology with a growing number of applications. Data flow in constraint programs is not explicit, and for this reason the concepts of slice and the slicing techniques of imperative languages are not directly applicable.
This paper formulates declarative notions of slice suitable for CLP. They provide a basis for defining slicing techniques (both dynamic and static) based on variable sharing. The techniques are further extended by using groundness information.
A prototype dynamic slicer of CLP programs implementing the presented ideas is briefly described together with the results of some slicing experiments.
The Presidential Election in Mexico of July 2012 has been the third time that PREP, Previous Electoral Results Program works. PREP gives voting outcomes based in electoral certificates of each polling station that arrive to capture centers. In previous ones, some statistical regularities had been observed, three of them were selected to make predictions and were published in \texttt{arXiv:1207.0078 [physics.soc-ph]}. Using the database made public in July 2012, two of the predictions were completely fulfilled, while, the third one was measured and confirmed using the database obtained upon request to the electoral authorities. The first two predictions confirmed by actual measures are: (ii) The Partido Revolucionario Institucional, PRI, is a sprinter and has a better performance in polling stations arriving late to capture centers during the process. (iii) Distribution of vote of this party is well described by a smooth function named a Daisy model. A Gamma distribution, but compatible with a Daisy model, fits the distribution as well. The third prediction confirms that {\it errare humanum est}, since the error distributions of all the self-consistency variables appeared as a central power law with lateral lobes as in 2000 and 2006 electoral processes. The three measured regularities appeared no matter the political environment.
From its inception at the beginning of the eighties, with milestone results and ideas such as quantum simulation, the no-cloning theorem, and quantum computers, quantum information has established itself over the next decades, being nowadays a fast-developing field at the interface between fundamental science and a variety of promising technologies. In this work we aim to offer a portrait of this dynamic field, analyzing the statistical properties of the network of collaborations among its researchers. Using the quant-ph section from the arXiv as our database, we draw several conclusions on its properties. In particular, we show that the quantum information network of collaborations displays the small-world property, is very aggregated and assortative, being also in line with Newman's findings as for the presence of hubs and the Lotka's law regarding the average number of publications per author.
We study a dynamics of the epidemiological infection spreading at different values of the risk factor $Ξ²$ (a control parameter) with the using of dynamic Monte Carlo approach (DMC). In our toy model, the infection transmits due to contacts of randomly moving individuals. We show that the behavior of recovereds critically depends on the $Ξ²$ value. For sub-critical values $Ξ²<Ξ²_{c}\sim 0.6$, the number of infected cases asymptotically converges to zero, such that for a moderate risk factor the infection may disappear with time. Our simulations shown that over time, the properties of such a system asymptotically become close to the critical transition in 2D percolation system. We also analyzed an extended system, which includes two additional parameters: the limits of taking on/off quarantine state. It is found that the early quarantine off does result in the irregular (with positive Lyapunov exponent) oscillatory dynamics of infection. If the lower limit of the quarantine off is small enough, the recovery dynamics acquirers a characteristic nonmonotonic shape with several damped peaks. The dynamics of infection spreading in case of the individuals with immunity is studied too.