Separating Style and Content with Bilinear Models

2000 ◽  
Vol 12 (6) ◽  
pp. 1247-1283 ◽  
Author(s):  
Joshua B. Tenenbaum ◽  
William T. Freeman

Perceptual systems routinely separate “content” from “style,” classifying familiar words spoken in an unfamiliar accent, identifying a font or handwriting style across letters, or recognizing a familiar face or object seen under unfamiliar viewing conditions. Yet a general and tractable computational model of this ability to untangle the underlying factors of perceptual observations remains elusive (Hofstadter, 1985). Existing factor models (Mardia, Kent, & Bibby, 1979; Hinton & Zemel, 1994; Ghahramani, 1995; Bell & Sejnowski, 1995; Hinton, Dayan, Frey, & Neal, 1995; Dayan, Hinton, Neal, & Zemel, 1995; Hinton & Ghahramani, 1997) are either insufficiently rich to capture the complex interactions of perceptually meaningful factors such as phoneme and speaker accent or letter and font, or do not allow efficient learning algorithms. We present a general framework for learning to solve two-factor tasks using bilinear models, which provide sufficiently expressive representations of factor interactions but can nonetheless be fit to data using efficient algorithms based on the singular value decomposition and expectation-maximization. We report promising results on three different tasks in three different perceptual domains: spoken vowel classification with a benchmark multi-speaker database, extrapolation of fonts to unseen letters, and translation of faces to novel illuminants.

2018 ◽  
Vol 52 (3) ◽  
pp. 337-345
Author(s):  
Vencel Somai ◽  
David Legrady ◽  
Gabor Tolnai

Abstract Background In emission tomography maximum likelihood expectation maximization reconstruction technique has replaced the analytical approaches in several applications. The most important drawback of this iterative method is its linear rate of convergence and the corresponding computational burden. Therefore, simplifications are usually required in the Monte Carlo simulation of the back projection step. In order to overcome these problems, a reconstruction code has been developed with graphical processing unit based Monte Carlo engine which enabled full physical modelling in the back projection. Materials and methods Code performance was evaluated with simulations on two geometries. One is a sophisticated scanner geometry which consists of a dodecagon with inscribed circle radius of 8.7 cm, packed on each side with an array of 39 × 81 LYSO detector pixels of 1.17 mm sided squares, similar to a Mediso nanoScan PET/CT scanner. The other, simplified geometry contains a 38,4mm long interval as a voxel space, detector pixels are assigned in two parallel sections each containing 81 crystals of a size 1.17×1.17 mm. Results We have demonstrated that full Monte Carlo modelling in the back projection step leads to material dependent inhomogeneities in the reconstructed image. The reasons behind this apparently anomalous behaviour was analysed in the simplified system by means of singular value decomposition and explained by different speed of convergence. Conclusions To still take advantage of the higher noise stability of the full physical modelling, a new filtering technique is proposed for convergence acceleration. Some theoretical considerations for the practical implementation and for further development are also presented.


2005 ◽  
Vol 62 (2) ◽  
pp. 310-330 ◽  
Author(s):  
A. Navarra ◽  
J. Tribbia

Abstract A new method is presented to detect the portion of variability connected between two climatic fields. The method is a realization of the Procrustes problem, and it is a generalization of methods for analysis of variance such as the singular value decomposition (SVD) or canonical correlation analysis (CCA). The Procrustes formulation offers a general framework to link together variance analysis methods, and regression methods, including as special cases SVD and CCA. Using this approach two fields can be divided into a subspace where variations of one field are connected to variations of the other field, the coupled manifold, and a subspace where variations are independent, the free manifold. The unified approach can be applied to prescribed SST experiments, in which case it recovers consistent results with other methods designed to identify the forced portion of variance, but it can now be extended also to the coupled case or to observations. Some examples from prescribed SST simulation experiments and observations are discussed.


2017 ◽  
Author(s):  
Ammar Ismael Kadhim ◽  
Yu-N Cheah ◽  
Inaam Abbas Hieder ◽  
Rawaa Ahmed Ali

2016 ◽  
Author(s):  
David Barner

Perceptual representations – e.g., of objects or approximate magnitudes –are often invoked as building blocks that children combine with linguisticsymbols when they acquire the positive integers. Systems of numericalperception are either assumed to contain the logical foundations ofarithmetic innately, or to supply the basis for their induction. Here Ipropose an alternative to this general framework, and argue that theintegers are not learned from perceptual systems, but instead arise toexplain perception as part of language acquisition. Drawing oncross-linguistic data and developmental data, I show that small numbers(1-4) and large numbers (~5+) arise both historically and in individualchildren via entirely distinct mechanisms, constituting independentlearning problems, neither of which begins with perceptual building blocks.Specifically, I propose that children begin by learning small numbers(i.e., *one, two, three*) using the same logical resources that supportother linguistic markers of number (e.g., singular, plural). Several yearslater, children discover the logic of counting by inferring the logicalrelations between larger number words from their roles in blind countingprocedures, and only incidentally associate number words with perception ofapproximate magnitudes, in an *ad hoc* and highly malleable fashion.Counting provides a form of explanation for perception but is not causallyderived from perceptual systems.


2020 ◽  
Vol 13 (6) ◽  
pp. 1-10
Author(s):  
ZHOU Wen-zhou ◽  
◽  
FAN Chen ◽  
HU Xiao-ping ◽  
HE Xiao-feng ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document