Roller Coaster STEM project

Marianna Mercado

School: iBot4Fun

MolSSI Education: Empowering the Next Generation of Computational Molecular Scientists.

Jessica A. Nash

and 3 collaborators

The is a research and education center that supports software development in the . One of ’s core objectives is to provide education and training for the next generation of computational researchers. Education targets various career stages and skill levels through its live workshops, online resources, and software fellowship program. Education focuses its efforts within four areas: programming and software development, and , faculty and curriculum development, and the software fellowship program. This article delineates educational efforts at the , overall goals, and resources that can be useful to researchers in the computational molecular sciences.

How useful are lexicostatistical and phylogenetic methods, in plotting the migration of Polynesian peoples across Oceania?

Noah Atkin

The importance of linguistics in the early study of Polynesian migration cannot be overstated. The work of many early linguists traced Polynesian languages back to their origins in South East Asia, and further linguistic analysis showed the Austronesian language family stretched from Madagascar to Easter Island. Historically, Polynesia has been an area where new fields of linguistics are applied, including glottochronology. In the modern day, the historical isolation of Polynesian sea-faring peoples provides a unique opportunity to test theories on early Austronesian migration by applying Bayesian and parsimony based phylogenetic techniques to the Oceanic languages. In this paper I hope to explain how languages were classically examined, and the extent these methods and modern day techniques, have played a role in elucidating the path of migration of these Oceanic sea-faring peoples.

Genotypic variation rather than ploidy level determines functional trait expression in a foundation tree species in the presence and absence of environmental stress

Michael Eisenring

and 5 collaborators

Background and Aims: At the population level, genetic diversity is a key determinant of a tree species’ capacity to cope with stress. However, little is known about the relative importance of the different components of genetic diversity for tree stress responses. We compared how two sources of genetic diversity, genotype and cytotype (i.e. differences in ploidy levels) influence growth, phytochemical, and physiological traits of Populus tremuloides in the presence and absence of environmental stress.

Methods: In a series of field studies, we first assessed variation in traits across diploid and triploid aspen genotypes from Utah and Wisconsin under nonstressed conditions. In two follow-up experiments, we exposed diploid and triploid aspen genotypes from Wisconsin to individual and interactive drought stress and defoliation treatments and quantified trait variations under stress.

Key Results: We found that 1) tree growth and associated traits did not differ significantly between ploidy levels under nonstressed conditions. Instead, variation in tree growth and most other traits was driven by genotypic and population differences. 2) Genotypic differences were critical for explaining variation of most of functional traits and their responses to stress. 3) Ploidy level played a subtle role in shaping traits and trait stress responses, as its influence was typically obscured by genotypic differences. 4) As an exception to the third conclusion, we showed that triploid trees expressed 17% higher foliar defense (tremulacin) levels, 11% higher photosynthesis levels, and 23% higher rubisco activity under well-watered conditions. Moreover, triploid trees displayed greater drought resilience than diploids as they produced 35% more new tissue than diploids when recovering from drought stress

Conclusion: Although ploidy level can strongly influence the ecology of tree species, those effects may be relatively small in contrast to the effects of genotypic variation in highly diverse species.

The First Paper

Cihat Kurt

Introduction

talk about what improvements are made in this version briefly.
Write a similar argument as below

Our paper is motivated by the recent publication [98] in which the FDTDM was used to compute the LDR for smoke clusters of up to four mono- mers in order to analyze implications of depolarization lidar observations from the Cloud—Aerosol Lidar and Infrared Pathfinder Satellite Observation (CALIPSO) satel- lite [115]. We extend the analysis of Ref. [98] by con- sidering a more comprehensive and representative set of soot-particle models and using what we believe to be more relevant refractive indices.\citep{Mishchenko_2013}

Right cardiac chambers echo-bubble contrast in a patient with decompression sickness: A case report and a literature revie

Allam Harfoush

and 2 collaborators

The diagnosis of decompression sickness (DCS) is mostly based on clinical suspicion, and there is currently no available modality to fully confirm the diagnosis. However, the use of echocardiography in suspected DCS cases has become more common. In this case, transthoracic echocardiography (TTE) was used to detect microbubbles in the right cardiac chambers and monitor the patient after hyperbaric oxygen therapy (HBOT), suggesting the possible applicability of TTE in diagnosing and monitoring DCS patients. This report describes a 54-year-old Fisherman who was referred to the emergency department with dyspnea and mild confusion after a rapid ascent of a saturation dive at 50 m sea depth. After the initial evaluation, he was assessed using TTE to exclude the presence of structural heart disease, where it surprisingly showed spontaneous echo contrast inside the right cardiac chambers similar to agitated saline echo testing. The patient was then admitted for HBOT and follow-up; rapid improvement was noticed after the first HBOT session and the TTE findings were fully resolved. TTE could be considered in the initial workup when DCS is suspected, and it might have a role in monitoring DCS patients if echocardiographic findings of bubble formation were documented in the pre-hyperbaric therapy settings.

BIO 465 Capstone Project Introduction

Stephen Richins

and 1 collaborator

Pellentesque tincidunt lobortis orci non venenatis. Cras in justo luctus, pulvinar augue id, suscipit diam. Morbi aliquet fringilla nibh, vel pellentesque dui venenatis eget. Orci varius natoque penatibus et magnis dis parturient montes, nascetur ridiculus mus. Donec ultricies ultrices magna gravida porta. Maecenas accumsan diam dui, auctor ornare ex pellentesque id. Integer tempus massa id augue finibus convallis. Nulla vestibulum, nisl id tempor pulvinar, felis dui pellentesque lacus, quis bibendum metus enim sed ex.

Lecture 12- Entanglement

Fred Jendrzejewski

and 2 collaborators

We will discuss the creation of entangled photons and how they can be used for the test of Bell's inequalities.

Robotic Pill for Biomarker and Fluid Sampling in the Gastrointestinal Tract

Fernando Soto

and 6 collaborators

Developing on-site biomarker enrichment platforms could help to improve the diagnosis of gastrointestinal (GI) tract diseases at early stages. Medical procedures, such as colonoscopies and imaging techniques, are used to diagnose disease, but are not easily accessible for repeat measurements. In the other hand, liquid biopsies, e.g., blood, urine, or fecal samples, have become important sampling strategies to identify health concerns. Herein, a robotic pill is designed for collecting relevant biomarkers from the GI over prolonged sampling periods. The robotic pill comprises a magnetic core for locomotion, a delayed gate mechanism that controls sampling location based on changes in its environment, and an enrichment module that traps biomarkers in an absorbent matrix while enabling biofluid to pass through the chamber. The robotic pill was assessed to sample microparticles, proteins, and bacteria from solution. Moreover, the robotic pill was capable of directed locomotion in complex environments and docking in a targeted region against fluid flow. Utilization of an untethered robotic sampling system could provide a tool to investigate aspects of disease initiation and progression for early diagnosis and therapy monitoring.

DarkCideS 1.0, a global database for bats in karsts and caves

Krizler C. Tanalgo

and 35 collaborators

Understanding biodiversity patterns as well as drivers of population declines, and range losses provides crucial baselines for monitoring and conservation. However, the information needed to evaluate such trends remains unstandardised and sparsely available for many taxonomic groups and habitats, including the cave-dwelling bats and cave ecosystems. Here, we present the DarkCideS 1.0, a global database of bat caves and bat species based on curated data from the literature, personal collections, and existing datasets. The database contains information for geographical distribution, ecological status, species traits, and parasites and hyperparasites for 679 bat species known to occur in caves or use caves in their life-histories. The database contains 6746 georeferenced occurrences for 402 cave-dwelling bat species from 2002 cave sites in 46 countries and 12 terrestrial biomes. The database has been developed to be a collaborative, open-access, and user-friendly platform, allowing continuous data-sharing among the community of bat researchers and conservation biologists. The database has a range of potential applications in bat research and enables comparative monitoring and prioritisation for conservation.

Tipos básicos de objeto no R: vetores e listas

Ronaldo Baltar

and 1 collaborator

Esse conteúdo digital foi criado como parte da atividade de capacitação ”Introdução ao R e RStudio para ciência social computacional”. É uma iniciativa de formação e difusão científica do Observatório de Populações e Políticas Púbicas (ObPPP) e do Programa de Formação Complementar de Graduação em Pesquisa Social Computacional (InfoSoc), ambos vinculados ao Dept.º C. Soc. / CLCH, Universidade Estadual de Londrina (UEL), Paraná - Brasil

1-Nitropyrene Exposure as Genotoxicity and Oxidative Stress Biomarker

Jayfe Anthony Abrea

Introduction

Belonging to the group of nitrated polycyclic aromatic hydrocarbons (NPAH), this 1-nitropyrene (1-NP) molecule, shown in Figure 1, often comes out as a dominant by-product of incomplete combustions from coal, diesel engines from mobile and stationary sources, cigarette smoke, cooked meat products and biomass burning. Aside from its dispersion in the atmosphere with concentrations arriving up to 57 pg/m³ in the air over urban and suburban areas, these hydrocarbons can deposit to ambient fine particulate matter (PM 2.5) which can spread towards inland bodies of water to be accumulated by aquatic species and then also reach towards human health mainly by inhalation, thereby posing respiratory risks. In fact, it has been listed as an International Agency for Research on Cancer (IARC) Group 2A carcinogen which indicates probable carcinogenicity and high respiratory health risk to humans [1].

Development of Self-folded Corrugated Structures Using Automatic Origami Technique by Inkjet Printing

Yuki Fukatsu

and 1 collaborator

The origami technique realizes unique mechanical properties of sheet materials without additional parts. In this study, a self-folded corrugated structure (SCS) is developed based on the reinforcing properties of the origami technique. The corrugated structures are employed as the core material for a high-strength, open-channel sandwich structure. Research on self-folded core materials is scarce; thus, a design concept is proposed, and the mechanical properties of the SCS are evaluated. First, the structural parameters of the SCS fabricated by changing the printing parameters (e.g., linewidth and number of lines/creases), to derive the structural model are determined. The model facilitates the design of an SCS with the desired structure. Thereafter, the mechanical properties of the SCSs are evaluated by conducting three-point bending tests to determine the essential design parameters corresponding to high stiffness. Moreover, SCSs can be stacked without occupying space, thus leading to improved strength. These SCSs fabricated using self-folding paper by ink-jet printing are low cost and eco-friendly. Moreover, they are specialized for rapid design and fabrication, depending on the application. This paper proposes the use of SCS as a novel smart core because it exhibits a high transportation efficiency and stiffness without additional components.

Corresponding author(s) Email: [email protected] , [email protected]

Four Bar Motion

Oswal Melean

Project overview

In this guided lesson the main goal is to show the student the applications of the four-bar mechanism and involve them through the construction of a four-bar mechanism using simple materials.

Age group

This lesson is intended for 10 - 12-year-old students.

Duration

50 - 60 minutes

Vocabulary

- Mechanism

- Linkage

- Link

- Crank

- Rocker

- Inversion

- Grashof law

Goals

- Understand how the four-bar mechanism works and its usage.

- Understand the different types of four-bar mechanisms and the relationship between inversion and length of links.

- Create a four-bar mechanism using common materials.

Materials

- Thumbtacks.

- Icecream Sticks.

- Scissors.

- Ruler.

Resources

- Crank Rocker Mechanism - https://www.youtube.com/watch?v=fCcWGXL-2Z8

- Design of Machinery: An Introduction to the Synthesis and Analysis of Mechanisms and Machines 3rd Edición - https://www.amazon.com/-/es/Robert-L-Norton/dp/0072864478

Lesson

Part 1 ( 25 minutes)

In this part of the lesson the students will learn about the four-bar mechanism, thy will learn the application of the crank-rocker mechanism in machines such as fans and wiper washers, additionally, they will understand the Grashof law and how to use it in order to make the linkages the right size for achieving the required movement.

Part 2 ( 30 minutes)

For this part of the lesson, the students have to use the Grashof law in order to choose the sizes of the linkages to make a crank-rocker mechanism, later they will measure, cut, assemble and decorate, their own mechanism in groups.

Part 3 ( 10 minutes)

Every team has to make a presentation of the mechanism in front of the class.

Trends and Health Risks of Heavy metal present in Sewage Sludge: A Situational Analysis in Indian Context

Vedpriya Arya

and 4 collaborators

Acharya Balkrishna^1,2, Sourav Ghosh¹, Srimoyee Banerjee¹, Sumit kumar Singh¹, Vedpriya Arya^{1, 2}*

Smart electronic nose enabled by an all-feature olfactory algorithm (AFOA)

Congfang

and 6 collaborators

Cong Fang^#, Hua-Yao Li^#, Long Li, Hu-Yin Su, Jiang Tang, Xiang Bai* and Huan Liu*

C. Fang

School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan 430074, China

H.-Y. Li, L. Li, H.-Y. Su, J. Tang, H. Liu School of Optical and Electronic Information, Optics Valley Laboratory, Huazhong University of Science and Technology, Wuhan 430074, China

X. Bai

School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China

E-mail: [email protected]; [email protected]

^#These authors contributed equally: C. Fang, H.-Y. Li

Keywords : electronic nose, all-feature extraction, deep learning, odor recognition, sensor array

Abstract: An electronic nose (e-nose) mimics the mammalian olfactory system in identifying odors and expands human olfaction boundaries by tracing toxins and explosives. However, existing feature-based odor recognition algorithms rely on domain-specific expertise, which may limit the performance due to information loss during the feature extraction process. Inspired by human olfaction, we propose a smart electronic nose enabled by an all-feature olfactory algorithm (AFOA), whereby all features in a gas sensing cycle of semiconductor gas sensors, including the response, equilibrium, and recovery processes are utilized. Specifically, our method combines one-dimensional convolutional and recurrent neural networks with channel and temporal attention modules to fully utilize complementary global and dynamic information. We further demonstrate that a novel data augmentation method can transform the raw data into a suitable representation for feature extraction. Results show that the e-nose simply comprising of six semiconductor gas sensors achieves superior performances to state-of-the-art methods on the Chinese liquor data. Ablation studies reveal the contribution of each sensor in odor recognition. Therefore, a deep learning-enabled codesign of sensor arrays and recognition algorithms can reduce the heavy demand for a huge amount of highly specialized gas sensors and provide interpretable insights into odor recognition dynamics in an iterative way.

1. Introduction

Humans can create their perception of the world through sight, hearing, touch, olfaction and taste. Olfaction is important for the survival of living species and allows living species to be able to identify suitable food, detect dangerous chemical substances, etc. The olfactory system of human is based on a chemical reaction that is more complicated than the physical stimulus in the vision and auditory systems. Odors are complex and always contain various types of gas molecules. For mammalian olfaction, each olfactory receptor cell possesses only one type of odorant receptors, and each receptor can detect a limited number of odorant substances. Our olfactory receptor cells (ORCs) are therefore highly specialized for a few odors. Namely, each odor molecule activates very few odorant receptors, leading to a combinatorial code and forming an “odor pattern”^[1]. Based on a large number of receptors and complex neural networks, we can discriminate more than one trillion olfactory stimuli^[2].

Inspired by the nature of the mammalian olfactory system, a “mode nose” was first introduced for gas identification in 1982^[3]. It contains a semiconductor gas sensor array that mimics the function of mammalian ORCs with a pattern recognition algorithm to simulate the operations of the nervous system. The non-specific semiconductor gas sensor detects a certain gas from a change in electrical resistance caused by the reaction between gas molecules and preabsorbed oxygen, thereby having cross sensitivity to a wide variety of odors^[4]. In 1987, Kaneyasu et al . from Hitachi in Japan named it an “electronic nose”^[5] (e-nose), and e-noses were introduced into many fields in the 1990s^[6-8]. The e-nose has shown great potentials for expanding the human sense, especially in recognizing gases with no flavor or low concentrations, which may find wide applications in environmental monitoring, food quality assessment, medical diagnosis, etc.^[9-12]. Unfortunately, to date, an e-nose mimics the mammalian olfactory system at a gross level. This is because semiconductor gas sensors are far behind ORCs in specificity, diversity and scale^[13]. Therefore, a powerful pattern recognition algorithm to handle complex gas-solid interactions under limited hardware conditions is needed.

Generally, traditional feature-based methods are all multistage, including feature extraction, dimensionality reduction, and classification^[14-16]. In feature extraction, some bioinspired or manually designed features^[17-19] are extracted from the response curves based on a basic understanding of the gas sensing mechanism, which mainly contains equilibrium statuses such as resistance values, response/recovery times, and the maximum derivation of the response times. In dimensionality reduction, variants of principal components analysis (PCA) are often used^[20-21]. Finally, existing feature-based methods use unsupervised learning^[22-23] and backpropagation artificial neural networks (BP-ANNs) for classification^[21,
24-26]. These feature-based methods mainly contain equilibrium statuses while neglecting response and recovery features, which may lead to local optima and information loss, before feeding them to the classifier. Therefore, the features extracted from the whole gas sensing curves, including the response, equilibrium, and recovery processes, can play an important role in odor recognition. Humans have only one-third as many types of olfactory receptors as mice but have superior processing power due to stronger brain connections^[27], and some studies have even found that odors can affect cognition^[28-29]. Enabled by the power of deep-learning, the focus of this study is on the accurate recognition of various odorant mixtures using only a small number of sensor units combined with an all-feature extraction algorithm in a complex circumstance (uncontrolled temperature and humidity). We hypothesize that all features in a gas sensing cycle of the sensor array can produce more distinguishing and robust features, thus reducing the heavy demand for the quantity and diversity of sensors. Hence, we need a more effective algorithm for application-specific sensing scenarios.

Recently, deep learning-based methods have exhibited surprising progress in computer vision, natural language processing, medical imaging, etc.^[30]. Unlike feature-based methods that heavily rely on intuition or domain-specific experience, deep learning-based methods attempt to learn high-level semantic features from mass data and jointly optimize feature extractors and classifiers to significantly decrease the burden on users. Introducing deep learning-based methods to e-nose technology can improve performance by learning nonintuitive features with deep learning. In addition, the learned features can also help us understand the principle of gas sensing and odor discrimination. Recently, some researchers have treated multichannel response curves as an image and used two-dimensional convolutional neural networks (CNNs) to extract local features and fully connected (FC) layers for classification in an end-to-end manner^[31-32]. Although these methods achieve performance improvements over feature-based methods, they ignore the long-term dependencies in time-series signals of the sensor array and bring nonnegligible computational and memory overhead. Wang et al .^[33] proposed a quantitative detection method of mixed gases based on long short-term memory (LSTM). This method heavily relies on domain-specific expertise, as the preprocessed response data to be analyzed are manually designed, which may lead to information loss before they are fed into LSTM. However, deep learning applied to raw data can help to better mine cross selectivity among only a few sensors.

To tackle the above issues, we fabricate an e-nose that consists of six different metal-oxide semiconductor (MOS) gas sensors, including SnO₂ QDs, SnO₂ nanowires, SnO₂ nanoparticles (NPs) synthesized by flame spray pyrolysis (FSP), In₂O₃ QDs, NiO NPs and WO₃ QDs. MOS favors the e-nose due to its high response rate, low cost, easy fabrication, and long-term stability. In particular, QDs are critical low-dimensional semiconductor materials^[34-35], whose dimensions in three axes are not larger than twice the Exciton Bohr radius. To reduce the heavy demand for the number and diversity of sensors, we use tailored data augmentation to handle all features in a gas sensing cycle and therefore transform the raw curves into different shapes to simplify distinguishing and robust feature mining. Specifically, our method combines one-dimensional CNNs and recurrent neural networks (RNNs) with channel and temporal attention modules to fully utilize complementary global and dynamic information in an end-to-end manner. We also demonstrate the generalization power of this data augmentation process, which can significantly improve the performance of feature-based methods. It is worth noting that the people who performed the measurements were not well trained, i.e., experimental errors were introduced into the data, which is similar to practical application scenarios. By consisting of only six non-specific semiconductor gas sensors, the AFOA-enabled e-nose can discriminate these Chinese liquors with high accuracy. It can also be concluded that QDs are superior to other sensors.

2. Methods

Os cinco macacos e o pensamento crítico

Ronaldo Baltar

and 1 collaborator

De tempos em tempos, circula pela Internet a estória motivacional dos “cinco macacos”. E sempre resulta em muitos comentários positivos. Neste início de ano, não foi diferente. Várias postagens, em diferentes redes, lembraram a estória que estimula as pessoas a pensarem diferente do senso comum. Uma espécie de convite ao pensamento crítico.

Resumidamente, para quem nunca recebeu um post ou e-mail com essa narrativa, a estória se inicia com o relato de um experimento científico. Um grupo de pesquisadores pendurou um cacho de bananas no teto de uma jaula com uma escada embaixo. Na jaula havia cinco macacos. Quando um dos macacos, após algum tempo observando a situação, subiu na escada para pegar as bananas, todos receberam um jato d’água fria. Passado algum tempo, outro macaco tenta subir na escada e todos novamente são alvo do jato de d’água. Logo, quando um dos macacos demonstra a intenção em subir a escada, os demais o impedem. O experimento segue, um dos macacos é trocado e não há mais jato d’água. Quando o novato tenta subir na escada para pegar as bananas, os quatro que presenciaram a situação anterior o impedem. O novato tenta e novamente é impedido. Os macacos são trocados um a um, e a cena se repete.

Ao final do experimento, mesmo sem ter presenciado a situação desagradável inicial, os macacos não tentam mais subir na escada para pegar a banana.

Com essa ilustração, o texto quer instigar as pessoas a serem críticas, proativas e inovadoras. A mensagem é: ao continuar a fazer as coisas do jeito que todos fazem você pode estar perdendo oportunidades que só conhecerá se se arriscar.

Desde que lançada, a estória tornou-se viral. Apareceu inicialmente em 2011, no blog do escritor Michael Michalko, autor de vários bons textos motivacionais sobre criatividade nos negócios, entre os quais: “Creative Thinkering: putting your imagination to work”.

O autor convida o leitor a ter uma visão crítica de si mesmo: será que você não é como um macaco do experimento, aquele que reproduz o mesmo jeito de fazer as coisas sem saber o motivo?

Você já se sentiu repreendido pelo grupo quando tentou fazer algo diferente? Provavelmente a grande maioria dos leitores dirá sim a estas perguntas embutidas no texto. Talvez isso explique o sucesso que essa estória faz.

Desde que recebi pela primeira essa mensagem (e já foram inúmeras!), chamou-me a atenção a ampla aceitação positiva dessa narrativa. Parece demonstrar que muitas pessoas não querem parecer conformadas e buscam ter um pensamento crítico em relação ao senso comum.

Uma postura proativa e inovadora, requer de fato um pensamento crítico e criativo. E pensamento crítico significa rever conceitos pré-estabelecidos.

Mas a inovação se faz a partir do acúmulo de conhecimento, não da negação da experiência adquirida como indiretamente sugere a estória dos cinco macacos. Além disso, a inovação depende da capacidade das instituições na criação de um ambiente inovador.

A estória dos cinco macacos enfatiza que quem tolhe as iniciativas são os iguais, os colegas de trabalho. Mas na verdade, são as instituições, não os indivíduos, que criam um ambiente favorável ou inibidor da crítica e da diversidade de ideias.

Sabe-se que o ponto de partida do pensamento crítico está na problematização da realidade por meio de informações e conhecimento sobre a realidade. O próximo passo consiste em separar, organizar, classificar, hierarquizar os fatos conhecidos. Com base em análise e método, faz-se a proposição de alternativas mais adequadas para o problema inicial. Daí surge a verdadeira inovação.

A criatividade que retira soluções do nada é magia. A criatividade que formula soluções a partir da análise da experiência acumulada, está sim gera conhecimento e tem impacto inovador.

A narrativa dos cinco macacos induz o leitor a crer que se está diante de uma experiência científica verdadeira. O curioso é que o adjetivo científico deveria significar exatamente o oposto. Um conhecimento científico é aquele obtido por um método demonstrável e passível de ser questionado. Mas é tratado erroneamente como uma afirmação de verdade inquestionável.

A “experiência científica” que deu origem à estória dos cinco macacos não existiu. É uma narrativa ficcional criada por Michalko. Supõem-se que tenha sido inspirada pelo experimento (este sim real) do Prof. Gordon Stephenson, do Departamento de Zoologia da Universidade de Wisconsin, publicado em 1966, no artigo: Cultural Acquisition of a Specific Learned Response among Rhesus Monkeys .

No artigo do Prof. Stephenson, pares de macacos Rhesus são usados para testar se há transmissão de conhecimento entre essa espécie. Não são cinco macacos, não há banana pendurada no teto, não há jato d’água fria.

A pergunta da pesquisa do Professor de Zoologia de Wisconsin era bem mais objetiva: há transmissão de comportamento adquirido entre os animais?

No experimento real, os pares eram compostos por um animal condicionado a evitar um alimento (com jatos de ar, não água) e outro não condicionado. Stephenson queria saber se o animal condicionado (ele chamava de “demonstrador”) transmitiria o seu “conhecimento” para aquele não condicionado (que ele denominava de “ingênuo”).

Quem ler o artigo do Prof. Gordon Stephenson verá que a conclusão do estudo é bem diferente da conclusão do texto sobre os cinco macacos. Na pesquisa real, em alguns pares, o macaco ingênuo copiou o comportamento do macaco condicionado (como reproduzido na estória dos cinco macacos). Em outros pares não. Houve pares em que se deu o contrário, o animal “ingênuo” acabou influenciando o macaco demonstrador, que passou por cima do seu condicionamento inicial e comeu o alimento (o oposto da estória dos cinco macacos). A narrativa dos cinco macacos não tem relação alguma com a realidade.

Na ficção dos cinco macacos, o novato é impedido pelos outros de se aproximar da escada, pois os veteranos sabem, por experiência o que receiam: o jato d’água fria. O macaco novato ignora o alerta e é repreendido pelos demais. O macaco novato se conforma. O leitor se identifica com o macaco novato e lamenta todas as vezes que teve uma iniciativa e foi tolhido. Diante de uma situação análoga, o leitor é incentivado a não se conformar com os veteranos e ir adiante atrás da sua banana.

A intenção aqui não é fazer uma crítica ao texto de Michalko, muito menos ao trabalho acadêmico do Prof. Gordon Stephnson. O objetivo é analisar a forma como a mensagem dos cinco macacos é interpretada, apontando exclusivamente a superação individual como caminho para inovação.

Para tanto, vamos ver esse experimento imaginativo por outro ângulo. Note que apenas os observadores (os pesquisadores na estória fictícia) sabem que não há mais jato d’água. Os macacos não sabem se haverá ou não jato d’água fria e estão confinados em uma jaula, não têm por onde sair. Logo, para os cinco macacos, a água fria continua a ser uma possibilidade concreta. O risco é real. Quem impõe o risco e incentiva o comportamento não criativo são os empreendedores do experimento, no caso, os cientistas imaginários.

A narrativa humaniza os possíveis comportamentos dos cinco macacos. Seguindo essa mesma linha, vamos supor que o novato, ao entrar na jaula, não fosse informado pelos demais sobre o perigo de se aproximar das bananas. Os veteranos sabiam, mas não disseram nada. Seria essa uma atitude racional para o grupo? Certamente, não. Os macacos fictícios, ao socializarem sua experiência, minimizaram o risco existente, representado pelos gestores do experimento que continuavam com a mangueira a postos. Ou seja, para eles, compartilhar a informação era uma forma de minimizar o risco a que todos estavam submetidos.

Ao final do experimento, nenhum dos cinco macacos tinha visto de fato o jato d’água. Mas, mesmo assim, por que teriam motivos para desconfiar da informação repassada a eles? Os primeiros macacos de fato receberam a carga desagradável de água fria. Essa era a única informação concreta disponível.

Você, se estivesse lá, arriscaria subir a escada, descartando a experiência dos seus companheiros? Se um macaco novato tentasse subir a escada, o que você faria? Incentivaria o novato ou tentaria dissuadi-lo, sabendo que as consequências do ato dele recaria sobre todos?

A estória é muito usada para motivar pessoas. O foco recai exclusivamente sobre o comportamento do indivíduo. Mas, na realidade, o problema maior está nas próprias instituições: a jaula, o jato d’água controlado pelos os observadores de fora.

Apostar que um indivíduo possa ir contra o senso comum de uma empresa ou instituição não preparada para mudanças é ingênuo.

Talvez, os macacos da estória não fossem tão conformados assim. Simplesmente, não confiavam que os gestores de fora da jaula (as instituições) não iriam jogar-lhes água fria. Você confiaria?

A estória dos cinco macacos é apenas uma versão pseudocientífica dos velhos adágios “gato escaldado tem medo de água fria” e "macaco velho não põe a mão em cumbuca".

Desprezar a experiência acumulada é um erro grave. O pensamento crítico não significa apenas olhar para o lado oposto e fazer o que outros não fazem. Quem aponta defeitos pode ser um crítico no sentido corriqueiro da palavra, mas não significa necessariamente ser uma pessoa com pensamento crítico voltado à inovação. Até porque, nem todos que identificam um problema corretamente, têm uma solução correta para o mesmo problema. Por isso, inovar requer um pensamento crítico coletivo, cujo ponto de partida está na experiência concreta acumulada pelo grupo.

O desafio é ir além do conformismo sem cair na ingenuidade voluntarista. A solução está em usar a experiência e o conhecimento da situação, incluindo erros e acertos, como a base sobre a qual se deverá erguer o olhar em busca de novas soluções, bem além do alcance do senso comum. Este é o pensamento crítico inovador.

Mas, não basta apenas a motivação pessoal. É necessário que haja liberdade para o pensamento e para a crítica. E essa situação só ocorre em ambientes institucionais que garantem as condições para a livre reflexão sobre a experiência comum adquirida, sem mangueiras reais ou imaginárias apontando para todos.

A motivação individual é a base, mas a inovação não é produto da ação individual isolada. Resulta de um ambiente favorável comum. Por isso, é papel das instituições criar as condições coletivas para a inovação e a criatividade, começando por compartilhar experiências, dar confiança e garantir que o pensamento novo, a diversidade de ideias e de opiniões não serão tratados com água fria.

The effects of forest edge and nest height on nest predation in a U.K. deciduous forest fragment

Noah Atkin

It has been previously hypothesised that nest predation is higher at forest edges. This has important conservation implications for the increasingly fragmented U.K. climax community. I aimed to test the generality of this edge effect in a mixed deciduous forest fragment which borders open grassland. Artificial nests containing a combination of quail and plasticine eggs were used, at ground and arboreal levels. I found an overall edge effect on nest predation rates, however this effect was not specifically seen in ground nests. Ground nests experienced significantly higher levels of predation than arboreal nests. I suggest this edge effect is due in part to the steep productivity gradient over the ecotone.

Nvidia Hopper GPU and Grace CPU Highlights

Anne C. Elster

and 2 collaborators

At GTC 2022 Nvidia announced a new product family that aims to cover from small enterprise workloads through exascale HPC and trillion-parameter AI models. This column highlights the most interesting features of their new Hopper GPU and Grace CPU computer chips and the Hopper product family. We also discuss some of the history behind Nvidia technologies and their most useful features for computational scientists such as the Hopper DPX dynamic programming instruction set, increased number of SMs, and FP 8 tensor core availability. Also included are descriptions of the new Hopper Clustered SMs architecture and updated NVSwitch technologies that integrates their new ARM-based Grace CPU.

A quick introduction to version control with Git and GitHub

John Blischak

and 2 collaborators

Introduction to version control

Many scientists write code as part of their research. Just as experiments are logged in laboratory notebooks, it is important to document the code you use for analysis. However, a few key problems can arise when iteratively developing code that make it difficult to document and track which code version was used to create each result. First, you often need to experiment with new ideas, such as adding new features to a script or increasing the speed of a slow step, but you do not want to risk breaking the currently working code. One often utilized solution is to make a copy of the script before making new edits. However, this can quickly become a problem because it clutters your filesystem with uninformative filenames, e.g. analysis.sh, analysis_02.sh, analysis_03.sh, etc. It is difficult to remember the differences between the versions of the files, and more importantly which version you used to produce specific results, especially if you return to the code months later. Second, you will likely share your code with multiple lab mates or collaborators and they may have suggestions on how to improve it. If you email the code to multiple people, you will have to manually incorporate all the changes each of them sends.

Fortunately, software engineers have already developed software to manage these issues: version control. A version control system (VCS) allows you to track the iterative changes you make to your code. Thus you can experiment with new ideas but always have the option to revert to a specific past version of the code you used to generate particular results. Furthermore, you can record messages as you save each successive version so that you (or anyone else) reviewing the development history of the code is able to understand the rationale for the given edits. Also, it facilitates collaboration. Using a VCS, your collaborators can make and save changes to the code, and you can automatically incorporate these changes to the main code base. The collaborative aspect is enhanced with the emergence of websites that host version controlled code.

In this quick guide, we introduce you to one VCS, Git (git-scm.com), and one online hosting site, GitHub (github.com), both of which are currently popular among scientists and programmers in general. More importantly, we hope to convince you that although mastering a given VCS takes time, you can already achieve great benefits by getting started using a few simple commands. Furthermore, not only does using a VCS solve many common problems when writing code, it can also improve the scientific process. By tracking your code development with a VCS and hosting it online, you are performing science that is more transparent, reproducible, and open to collaboration \cite{23448176, 24415924}. There is no reason this framework needs to be limited only to code; a VCS is well-suited for tracking any plain-text files: manuscripts, electronic lab notebooks, protocols, etc.

Design and empirical validation of effectiveness of LANGA, a game-based platform for second-language learning

Francesco Usai

and 2 collaborators

Abstract

LANGA is a game-based platform for second language learning. LANGA is first of its kind in that it blends a research-driven approach to teaching with advanced animation to make learning fun and engaging. Here, we present a detailed description of the architecure of LANGA and empirical evidence for its effectiveness.

Introduction

The technological revolution is paving new ways to deliver education, holding the promise to make it more accessible and to be centered around the strengths and weaknesses of each individual user. In the field of second language (L2) learning, computer and smartphone-based applications are now gaining momentum as a form of support — or as an alternative — to more conventional ways of L2 teaching such as periods of immersion or classroom-based learning. These technologies — which we refer to as L2 apps — are extremely popular; for example as of March, 2016 the L2 app Duolingo reported having 100 million users, and being the most-downloaded educational app for the iOS platform ({}) . Many L2 apps feature recent technological advances such as animation, and use game mechanics to make interaction fun and engaging. However, these products differ widely in terms of teaching philosophy, as reflected in how the material is delivered and how the learner interacts with the software. Despite the growing popularity of these applications there are still many unresolved issues concerning their theoretical foundations and, most importantly, their efficacy — issues shared with related domains, such as educational apps, cognitive training, and rehabilitation (e.g., Hirsh-Pasek 2015, Simons 2016; also {} and {}). First of all, products in the education market are not always designed and validated using scientific evidence; further, there are no accepted standards for empirical evidence of efficacy as there are for drugs or medical devices (Better reading throug..., Goswami 2006). In fact, in most cases there are no empirical studies available to back up the claims made by the developers of educational apps Hirsh-Pasek 2015. Thus even when research (either independent or conducted in-house by developers) is available, the extent of this may not be sufficient to fully address questions of efficacy. We are not aware of any effectiveness study published in a peer-reviewed journal.

This is not to say that available L2 apps are not supported by empirical evidence, as many report in-house research and some even make research reports available on their Web sites (e.g., {}). Moreover, use of many L2 apps suggests that they are at least partially based on principles shown by previous empirical research to be effective in L2 teaching (e.g., spaced repetition; Pimsleur 1967). For example, Vesselinov R.Vesselinov 2009, R.Vesselinov 2012, tested the effectiveness of two popular L2 apps (RosettaStone and Duolingo) in teaching basic knowledge of Spanish. In one study R.Vesselinov 2009, participants used the software for a total of 55 hours, a period that is equivalent to the amount of time a typical semester-long beginner’s class would last. Although the study reported statistically significant improvements in proficiency at the group level, by-subject analysis of performance showed no change for 36% of users, suggesting high individual variability in learning outcomes using the app. In a second study, data were collected from online users whose total usage

Notably, the report did not provide any indication about the training intensity, i.e. how many days/week and hours/day subjects were expected to use the software. This makes interpretation of results difficult, since intensity is a strong predictor for rate of learning and retention of knowledge. In this study, the primary outcome measure was the change in spoken proficiency from pre- to post-training. Although the study reported statistically significant improvements in proficiency at the group level, by-subject analysis of performance showed no change for 36% of users.

A likely explanation for this result is that individuals’ ability to learn a new language is influenced by a multitude of factors (citation needed). For example, people receiving the same training and amount of exposure to a given language might differ significantly in how fast they learn, how long they retain the material, how well they respond to a specific type of training etc. Cognitive neuroscience research has provided great contributions in support of the view that individual differences (IDs) in socioeconomic status, level of education, age, lifestyle, motivation, engagement with the task (citations needed), and cognitive skills (i.e. working memory (WM) span, executive attention, verbal memory etc.) (citations needed) predict the extent to which people improve as a result of training. Notably, some of the above-mentioned factors such as motivation were not controlled for in the study conducted by Vesselinov R.Vesselinov 2009. While any single study cannot account for all possible variables, interpretation of the scant data available is difficult since it is not clear to what extent the changes in performance observed in a subgroup of subjects are attributable to the use of the software, or simply to participants’ motivation to learn a second language.

The latter hypothesis is supported by results of the second study R.Vesselinov 2012 conducted by the same researcher, testing the effectiveness of Duolingo. In this study data were collected through the online accounts of a pool of Duolingo clients who gave consent to participate. Although in this case the time spent using the software between pre- and post-training was not the same among participants, it was estimated that using Duolingo for a period of time between 26 and 49 hours would be sufficient to observe statistically significant gains, and that motivation and baseline level of Spanish proficiency predicted between-subjects differences in change in performance. Collectively, the studies R.Vesselinov 2012, R.Vesselinov 2009 are of particular relevance considering that they currently represent the only instances of empirical investigation of computer-based second-language training platforms. Nonetheless, their limitations point to the importance of a careful study design in terms of choosing appropriate control groups and measures of a comprehensive set of factors to adequately parse out the effect of training from other confounding variables. Furthermore, one of the ultimate goals of research in second language acquisition is to develop accurate models of how IDs in social and cognitive factors predict optimal ways to learn a second language for each individual in terms of training strategy, intensity, type of content etc.. This would allow us to move from the traditional “one-size fits all” approach to a personalized one that optimally adapts to the unique strengths and weaknesses of each individual. Such an approach requires large-scale studies (hundreds of subjects) involving people with the most diverse background and cognitive skills, two conditions that are hardly met by traditional lab-based research for obvious reasons. Instead, web-based applications can be a game-changer and represent an ideal solution to this problem since they can reach out virtually everyone with an internet connection with no cost, removing physical barriers due to transportation and speeding-up the research process.

These considerations have largely inspired our approach to the design of LANGA (LANguage GAming), a language learning platform founded on five principles:

Use of compelling video-games that make training fun and engaging
Use of an advanced SRE system to provide real-time feedback about pronunciation
Serve as a powerful research tool that, operating through the web, can reach a vast and heterogenous population that cannot be accessed with traditional lab-based experiments
Collect behavioral and neurophysiological data, at every stage of learning, in order to systematically test the effectiveness of current versions and inform the design of the next generation of training platforms
Allow researchers to easily manipulate the main parameters of the training tasks in order to quickly test specific hypothesis about efficacy.

LANGA is the result of a collaboration between our laboratory and a private company, Copernicus Studios Inc.. We have thus combined the strengths of our lab in empirical behavioural and cognitive neuroscientific work on language processing with the company’s strengths in animation and game design. Our goal is to produce a language learning software platform that is effective and enjoyable for learners, but that also houses a flexible “back end” that will allow teachers and researchers to customize the learning process for groups of people, and empirically test hypotheses concerning language learning to generate better understanding of the mechanisms leading to successful L2 acquisition. In the remainder of this paper we discuss the design principles and architecture of LANGA, and provide details of one example study used to assess the efficacy of some games implemented on the platform.

The architecture

In this section we describe the high-level organization of LANGA. Details about the specific components (i.e. the grammars, games, lessons etc.) and how they relate to each other are presented in the following subsections. The tasks executed by the user are made of three main building blocks: content, games and curricula. Content refers to the material being taught in the new language – including vocabulary, sounds, and grammatical structures as well as the associated media (artwork and sound files). The games define, intuitively, the set of rules through which the user interacts with the content in any given trial. The curricula are the higher-order mesh that connect games and content – including the selection of what content to teach in what order, the combination and sequencing of vocabulary, pronunciation, and grammar training, and the duration and frequency of training sessions (dose). As mentioned before, one of the aims of LANGA is to allow researchers with little experience with software engineering to easily test novel hypotheses. To do that, researchers must be able to quickly build and deploy new content and curricula and to be able to visualize data analytics performed on the behavioral and neurophysiological data to validate their hypothesis. The Learning Management System (LMS) accomplishes both these goals through its main components: the Authoring Tool and the User Database. The Authoring Tool allows researchers to build the content (i.e. vocabulary, grammars etc.), implement different training strategies, assemble these components into lessons and, finally, implement cognitive tests needed to analyze subjects’ background profile. On the other end, the User Database stores all the information required to research purposes such as the scores on the cognitive tests and online information gathered from subjects’ performance (i.e. accuracy in pronouncing specific words/sentences, session by session improvement, number of attempts to pronounce a given item etc.).

Grammars

The grammar is the functional unit of training. Briefly, grammars define the set of words or sentences that are taught in any given instance of a game. Grammars are always made of four words/sentences. Within the grammar there is a picture for each word, the correspondent written translation and sound file. Every time the user is prompted to name a word from a given grammar, the SRE matches the sound file recorded through a microphone and compares it with the sound files of the words belonging to that grammar. A matching score is produced for each comparison and, in case the highest one overcome a pre-established threshold, the attempt to name the word is given positive feedback. The threshold can be set for each individual word based on how difficult they are to pronounce, allowing minimization of both false positives and negatives.

Training strategies

Currently, we have implemented and tested the effectiveness of two different strategies for teaching vocabulary. One strategy, referred as “rote” training, is a stimulus-paired associative task typically used both in classroom settings or by other computer-based programs. Simply, in this task each word to be learned (nouns and verbs) is represented by a picture and paired with its spoken form (see Comparison of performance in the match-mismatch task across different stages of training. Barplots represent mean and 95% CI of the accuracy scores ., left panel). The other strategy, the “inferential” training consists of pairing a picture depicting an actor performing an action on an object with the corresponding Spanish written and spoken three-word sentence with a subject-verb-object (SVO) structure (e.g., La bruja abate el basurero —- “The witch knocks over the garbage can”). The learner is required to guess the meaning of the individual words in the sentence based on the picture, prior knowledge, and the SVO syntax; in a series of training items each picture/sentence varies from the last only in one word (i.e., one of the nous or the verb). This aids the learner in making inferences as to word meanings, and “bootstrapping” later learning based on newly-acquired knowledge about the language. Some prior research has shown that such “implicit” training for L2 grammar can lead to better long-term retention and patterns of brain activity that more closely resemble those of native speakers Morgan-Short 2012, Morgan-Short 2012.

Scratch Pad / Multiple Dirichlet Convolutions

Benedict Irwin

Series Expansion, extracting the coefficients

Consider $$ \phi(s) = c $$ easy as moments are constant... arbitrary $$ \phi(s) = c_1 c_2^s $$ again both arbitrary constants... special points s = 1, s = 0, c₁ = ϕ(0), c₂ = ϕ(1)/ϕ(0). Consider first meaningful case $$ \phi(s) = c_1 c_2 ^ s \Gamma(c_3 + c_4 s) $$ $$ \phi(0) = c_1 \Gamma(c_3) $$ $$ \phi(1) = c_1 c_2 \Gamma(c_3 + c_4) $$ $$ \frac{\phi(1)}{\phi(0)} = c_2 \frac{\Gamma(c_3+c_4)}{\Gamma(c_3)} $$ which if d is an integer can be expanded as a Pochammer $$ \phi(\frac{1}{c_4}) = c_1 c_2 ^ {1/c_4} \Gamma(c_3 + 1) = c_1 c_2 ^ {1/c_4} c_3 \Gamma(c_3) $$ $$ \phi(\frac{1}{c_4})\phi^{-1}(0) = c_2 ^ {1/c_4} c_3 $$ also consider the points where c₃ + c₄s ∈ {1, 2} such that the gamma term vanishes to 1.

$$ \phi\left(\frac{1-c_3}{c_4}\right) = c_1 c_2 ^ \frac{1-c_3}{c_4} $$ $$ \phi\left(\frac{2-c_3}{c_4}\right) = c_1 c_2 ^ \frac{2-c_3}{c_4} $$ noting that $$ \phi\left(\frac{2-c_3}{c_4}\right)\phi^{-2}\left(\frac{1-c_3}{c_4}\right) = c_1^{-1} c_2^{c_3/c_4} $$ $$ \phi\left(\frac{2-c_3}{c_4}\right)\phi^{-1}\left(\frac{1-c_3}{c_4}\right) = c_2^{1/c_4} $$

Which means (great result) $$ \phi(\frac{1}{c_4})\phi^{-1}(0)\phi^{-1}\left(\frac{2-c_3}{c_4}\right)\phi\left(\frac{1-c_3}{c_4}\right) = c_3 $$ if we find enough equations for each parameter, there is a chance of a self consistent/iterative solution? We would ideally want something like c₁(c₂, c₃, c₄), etc. unless it doesn’t really matter... $$ \phi^{c_4}\left(\frac{2-c_3}{c_4}\right)\phi^{-c_4}\left(\frac{1-c_3}{c_4}\right) = c_2 $$ $$ \frac{\phi(0)}{\Gamma(c_3)} = c_1 $$ $$ \frac{1}{\log_{c_2}\left(\phi\left(\frac{2-c_3}{c_4}\right)\phi^{-1}\left(\frac{1-c_3}{c_4}\right)\right)} = c_4 $$

However, there are a couple of important factors where we can’t stray into territory that is broken due to analytic continuation...

For some close parameters on at least one example, this works for c₁ and c₃, the goal is then to get the remaining ones using those.

Consider the log derivative as a secret weapon... measure $$ \theta(s) = \frac{d}{ds} \log \phi(s) = \log(c_2) + c_4 \psi(c_3 + c_4 s) $$

$$ \theta(0) = \log(c_2) + c_4 \psi(c_3) $$

also we know that (ψ(1)= − γ,ψ(2)=1 − γ)... $$ \theta\left(\frac{1-c_3}{c_4}\right) = \log(c_2) - c_4 \gamma $$ $$ \theta\left(\frac{2-c_3}{c_4}\right) = \log(c_2) + c_4 (1-\gamma) $$ then $$ \theta\left(\frac{2-c_3}{c_4}\right) - \theta\left(\frac{1-c_3}{c_4}\right) = c_4 $$ $$ \theta\left(\frac{2-c_3}{c_4}\right) + \theta\left(\frac{1-c_3}{c_4}\right) = 2\log(c_2) + c_4 (1- 2 \gamma) $$

Next Case

Consider the more interesting $$ \phi(s) = c_1 c_2^s \frac{\Gamma(c_3 + c_4 s)}{\Gamma(c_5 + c_6 s)} $$ key points include $$ \phi(0) = c_1 \frac{\Gamma(c_3)}{\Gamma(c_5)} $$ $$ \phi(1) = c_1 c_2 \frac{\Gamma(c_3 + c_4)}{\Gamma(c_5 + c_6)} $$ and potentially important points $$ \phi\left(\frac{1-c_3}{c_4}\right) = c_1 c_2^\frac{1-c_3}{c_4} \frac{1}{\Gamma(c_5 + c_6 \frac{1-c_3}{c_4} )} $$ $$ \phi\left(\frac{2-c_3}{c_4}\right) = c_1 c_2^\frac{2-c_3}{c_4}... $$ $$ \phi\left(\frac{1-c_5}{c_6}\right) = c_1 c_2^\frac{1-c_5}{c_6}... $$ $$ \phi\left(\frac{2-c_5}{c_6}\right) = c_1 c_2^\frac{2-c_5}{c_6}... $$

We also have $$ \theta(s) = \log(c_2) + c_4 \psi( c_3 + c_4 s) - c_6 \psi( c_5 + c_6 s) $$ $$ \theta(0) = \log(c_2) + c_4 \psi( c_3 ) - c_6 \psi( c_5 ) $$ $$ \theta(1) = \log(c_2) + c_4 \psi( c_3 + c_4) - c_6 \psi( c_5 + c_6) $$ $$ \theta(p_{134}) = \log(c_2) - \gamma c_4 - c_6 \psi( c_5 + c_6 p_{134}) $$ $$ \theta(p_{234}) = \log(c_2) + (1-\gamma) c_4 - c_6 \psi( c_5 + c_6 p_{234}) $$ etc.

Try $$ \theta(p_{234}) - \theta(p_{134}) = c_4 - c_6 \psi( c_5 + c_6 p_{234}) + c_6 \psi( c_5 + c_6 p_{134}) $$

We might consider the function f₃₄₅₆(s) that sets $$ \frac{\Gamma(c_3 + c_4 f_{3456}(s))}{\Gamma(c_5 + c_6 f_{3456}(s))} = 1 $$

In each case, we really need to think about a value of s that specifically exposes a particular parameter... For Γ(a + bs), to expose the a we can consider a root finder for Γ(x^*)−a = 0 and then scale the result as (x^* − a)/b... then we just evaluate all the other terms at that point, so in general

$$ \phi(s) = \frac{\Gamma(a_1 + b_1 s)...\Gamma(a_k + b_k s)}{\Gamma(c_1 + d_1 s)...\Gamma(c_l + d_l s)} $$ solve for a bunch of roots numerically for each gamma term. $$ x^*_{\uparrow k} = \frac{root(\Gamma(x) - a_k) - a_k}{b_k} $$ $$ x^*_{\downarrow k} = \frac{root(\Gamma(x) - c_k) - c_k}{d_k} $$ we evaluate the moment function directly at these roots $$ \phi(x^*) $$ to expose the b and d terms, we need to evaluate the log derivative of the moment function θ(s)... this amounts to a weighted sum of digamma functions... we need to figure our how the roots work there $$ c_4 \psi(c_3 + c_4 s) \to -\gamma c_4 $$ alternatively, we could try to equip the root finders with a way to collapse each gamma function into the other parameter...

After this we end up with a set of equations $$ \phi(x^*_{\uparrow j}) = \frac{\Gamma(a_1 + b_1 s)\cdots a_j\cdots \Gamma(a_k + b_k s)}{\Gamma(c_1 + d_1 s)\cdots\Gamma(c_l + d_l s)} $$ $$ \phi(x^*_{\downarrow j}) = \frac{\Gamma(a_1 + b_1 s)\cdots \Gamma(a_k + b_k s)}{\Gamma(c_1 + d_1 s)\cdots c_j \cdots \Gamma(c_l + d_l s)} $$ and we set the update rules to be $$ a_j \to \frac{\phi(x^*_{\uparrow j})}{\Gamma(a_j + b_j x^*_{\uparrow j})}\frac{\Gamma(c_1 + d_1 x^*_{\uparrow j})\cdots\Gamma(c_l + d_l x^*_{\uparrow j})}{\Gamma(a_1 + b_1 x^*_{\uparrow j})\cdots\Gamma(a_k + b_k x^*_{\uparrow j})} $$ and like wise... This means we only need to evaluate the original function, and the single extra divisor.

Note

For logD_x we seem to have $$ \frac{\log D_x f(x)}{f(x)} + \log(x) = g(x) $$ then g(x) is relatively well behaved and for powers of x simply $$ x^k \to \psi_0(k+1) $$ but works for e^x and logx as f(x), among others...

We seem to have $$ \log D \log D 1 \equiv \lim_{h \to 0} \frac{D^h 1 - 2 D^{-h} 1}{h^2} $$

Abstract

Consider multiple Dirichlet convolutions.

Main

Example $$ \mu^3 = \sum_{abc=n}\mu(a)\mu(b)\mu(c) = A007428 \to \frac{1}{\zeta^3(s)} $$ $$ \omega \mu^2 = \sum_{abc=n}\omega(a)\mu(b)\mu(c) = A143519 = \sum_{d|n} \chi_p(d)\mu\left(\frac{n}{d}\right) = \sum_{p|n} \mu\left(\frac{p}{d}\right) $$ $$ \omega^2\mu = \sum_{abc=n}\omega(a)\omega(b)\mu(c) = A345354 = \sum_{p|n} \omega\left( \frac{n}{p} \right) $$

In Shorthand $$ \Omega \omega \mu = A307409 = (\Omega(n)-1)\omega(n) = \sum_{p|n} \Omega\left( \frac{n}{p} \right)\to ... $$

So including μω into a triple, has the effect of summing over prime divisors.

$$ \omega^2 \Omega = ??? $$

$$ \omega^3 = apparently not A200221! $$

Then for four terms apparently $$ \mu^2 \omega^2 = A230595? = \chi_p * \chi_p \to \zeta_p(s)^2 $$

Of course we could also have more varied and complicated expressions such as $$ \sum_{abc=n}\omega(a)\omega(b)\mu(c)\omega(c) $$

$$ |\mu^2 \Omega| = A344478 ? \to ??? $$

$$ \mu^2 \lambda = A326415 = \sum_{d|n} \mu_2(d) \lambda(n/d) = \sum_{d|n} \mu_3(d)\chi_\square(n/d)\to \frac{\zeta(2s)}{\zeta(s)^3} $$ where μ₂, μ₃ are iterative applications of μ to ϵ.

Which is ’Moebius transform applied twice’ to λ.

$$ \mu^2 x = A007431 = \sum_{d|n} \phi(d) \mu(n/d) $$ where x is to just sum over a or b or c directly in the product... This is moebius transform applied twice to natural numbers... (because of the mu-squared).

We can check to see if μx as a token is like “divisor sum, phi” as a meaning?

Seems that

$$ \mu x \Omega = A095112 = \sum_k \Omega(gcd(n,k)) = \sum_{d|n} \phi(d) \Omega(n/d) $$

So we have for some n.t. function Q $$ \mu \omega Q \equiv \sum_{d | n} \chi_p(d) Q(n/d) $$ $$ \mu x Q \equiv \sum_{d | n} \phi(d) Q(n/d) $$

We can conclude that $$ \sum_{abc = n} f(a)g(b)h(c) = \sum_{d|n}\left[\sum_{q|d}f(q)g\left(\frac{d}{q}\right)\right]h\left(\frac{n}{d}\right) $$ which makes complete sense. This can be extended arbitrarily deep $$ \sum_{x_1x_2x_3x_4 = n} f_1(x_1)f_2(x_2)f_3(x_3)f_4(x_4) = \sum_{d_4|n}\left[\sum_{d_3|d_4}\left[\sum_{d_2|d_3}f_1(d_2)f_2\left(\frac{d_3}{d_2}\right)\right]f_3\left(\frac{d_4}{d_3}\right)\right]f_4\left(\frac{n}{d_4}\right) $$

For any unknown sequence s(n), we can then try to ’fit’ a depth n function. We can define a set of sequences, and represent each f as a linear combination (ideally with weights 0, 1...), but then we expand out the terms like

$$ \sum_{d|n} \mathbf{a\cdot f(d)} \mathbf{ b \cdot g(n/d)} = $$

———————–

Graph Analogy

Consider a graph, with nodes and edges... Consider subgraphs as analogies to integer divisors, perhaps a good analogy is chemical compounds.

Consider the existence of a function that takes a molecular graph as input and outputs a number, i.e. a descriptor calculator such as count the number of rings, or number of appearances of certain chemical groups. f(G)=x.

Now consider a notation $$ g(G) = \sum_{S|G} f(S) $$ where a function g of the graph G is the sum of another function f applied to all the possible subgraphs S of G, including perhaps G itself...

What we might be missing is the notion of G/S if we think of this a G without S, then we can think of G being S₁S₂S₃S₄ however, for the analogy of numbers we need to consider the existence of prime subgraphs for which there is only one unique graph factorisation.

The main problem is there are many ways to attach fragments to compounds. A number is like a bag of prime factors, it is only the count of each prime that matters, there is only one result once the bag is evaluated and the order of evaluation does not matter. For a compound, although there might exist a (very large) set of fragments that could somehow be defined that reasonably cover the space of interesting molecules, if we defined each compound as a bag of these ’prime fragments’, 1) the order at which they are taken out of the the bag and stuck together (concatenated) matters [a combinatoric problem], 2) where exactly they are joined together matters [a second combinatoric problem].

Issues with SMARTS counts. We would have to define all prime fragments such that when they were joined in any way no new prime fragments could be counted.

——————–

Abstract

We consider a method of approximating functions by a statistically optimal matrix

Main

Consider a function on a fixed domain, e.g. [0, 1]. If we randomly sample a vector of points from the domain and sort the vector into a new vector x, then we have a distribution for each element. For small vectors there will be a reasonable level of variation, for larger vectors the variation will narrow. We then apply the function to that vector to get f. We will consider the statistical optimum matrix A such that Ax ≈ f for any sorted input x.

Some questions, is there only one optimal? Does the accuracy increase arbitrarily for arbitrarily large matrix. Do the eigenvalues and vectors of A has any connection to the true function f(x)? Is there a way of constructing A from the function f?

We can consider the samples x. This looks a little like a stick breaking process, and is also related to the distribution of the minimum, or maximum, or k^th smallest element from a n samples of a uniform distribution.

We can first consider a diagonal A for simplicity, but we find this cannot well describe any function but a line so generalise to a matrix to allow overlap information.

We have the order statistics for a uniform distribution as $$ x_k \sim \mathrm{Beta}(k,n+1-k) $$ then simplistically for the 2 × 2 case we have $$ \begin{bmatrix} a_{11} & a_{12} \\ a_{21} & a_{22} \end{bmatrix} \begin{bmatrix} x_1 \\ x_2 \end{bmatrix} = \begin{bmatrix} f(x_1) \\ f(x_2) \end{bmatrix} $$

Utilizzo dei modelli per la stima della qualità dell'aria

Annamaira Shara Ferruzzi

and 1 collaborator

La legislazione europea e il suo recepimento nella normativa nazionale incoraggiano lo sviluppo e l'uso di sistemi modellistici di qualità dell'aria che sono considerati di primaria importanza nelle valutazioni preliminari di qualità e utili per completare il contenuto informativo delle misure dirette. Il funzionamento dei modelli di dispersione consiste nel simulare gli effetti di una o più sorgenti di emissione in termini di concentrazione degli inquinanti emessi in corrispondenza di fissati recettori.

Simple Physics with Python: a workbook on introductory Physics with open source software

Andrea Mandanici

and 4 collaborators

Skills in computer science can have great value in studying, doing and communicating physics. As educators, we asked ourselves how to make students aware of that, and how to offer them a new and appealing approach to physics. We also wondered how to increase students' engagement, participation and understanding, particularly when lessons are delivered online. Thus, we began a project to develop study materials for an introductory course in physics for computer science and we chose to use open source software. The materials are organized as a set of Jupyter notebooks hosted on an open GitHub repository. The notebooks deal with fundamental concepts of physics related to everyday life, offering examples of what can be done with a few lines of Python code. In the notebooks we propose activities to observe phenomena, describe problems, experiment, acquire and analyze data, and model the behavior of systems. The contents are suitable for undergraduates, high-school students, and evergreen students. We have used the materials for lectures, guided laboratory activities and presentations to freshmen and younger students, and we plan to continue with this project.