<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Memming</title>
	<atom:link href="http://memming.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://memming.wordpress.com</link>
	<description>the inconsistent</description>
	<lastBuildDate>Wed, 04 Jan 2012 22:47:55 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='memming.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Memming</title>
		<link>http://memming.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://memming.wordpress.com/osd.xml" title="Memming" />
	<atom:link rel='hub' href='http://memming.wordpress.com/?pushpress=hub'/>
		<item>
		<title>NIPS 2011</title>
		<link>http://memming.wordpress.com/2011/12/21/nips-2011/</link>
		<comments>http://memming.wordpress.com/2011/12/21/nips-2011/#comments</comments>
		<pubDate>Wed, 21 Dec 2011 13:40:26 +0000</pubDate>
		<dc:creator>memming</dc:creator>
				<category><![CDATA[Conference]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[NIPS]]></category>
		<category><![CDATA[non-parameteric Bayes]]></category>

		<guid isPermaLink="false">http://memming.wordpress.com/?p=609</guid>
		<description><![CDATA[This was my second NIPS (see last year&#8217;s NIPS summary). It had a lower acceptance rate of 22% (I served as a reviewer last year and this year). I felt like there were more computational neuroscience related posters than last year (perhaps due to the location in Europe). Non-parametric Bayes, reinforcement learning (MDP), and sparse [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=609&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>This was my second NIPS (see <a title="NIPS 2010 main conference" href="http://memming.wordpress.com/2010/12/14/nips-2010-main-conference/">last year&#8217;s NIPS summary</a>). It had a lower acceptance rate of 22% (I served as a reviewer last year and this year). I felt like there were more computational neuroscience related posters than last year (perhaps due to the location in Europe). Non-parametric Bayes, reinforcement learning (MDP), and sparse learning were still big while kernel related posters were less. This post is a summary of my experience, and any error is due to myself (please let me know if you find any).</p>
<h3>Monday</h3>
<p><strong>Dynamical segmentation of single trials from population neural data</strong><br />
<a href="http://www.gatsby.ucl.ac.uk/members.html">Biljana Petreska</a>, M. Sahani, B. Yu, J. Cunningham, S. Ryu, K. Shenoy, Gopal Santhanam</p>
<p>A randomly switching piecewise-linear dynamical system model is constructed via discrete latent states. Given a state, the dynamics of spiking neurons are assumed to be linear. This model is fit to 105 simultaneously recorded neurons (Utah array) during a motor task. Number of states were chosen heuristically. This is an unsupervised method that automatically captures the structure of the dynamics. The results suggest that neurons tend to be in a linear dynamical state both when waiting for the go-cue, and during early movement, and goes through nonlinear dynamical transitions in between.</p>
<p><strong>Inferring spike-timing-dependent plasticity from spike train data</strong><br />
Ian H. Stevenson, Konrad P. Kording</p>
<p>Different synapses have different form of STDP, and while spike train data are abundant, in vivo whole cell recordings are very difficult. Hence, learning the synaptic plasticity rule from just spike train observation is of great importance. This is one of my long-term goals as well. They fit a unidirectionally coupled GLM model with a binned weight modulation function as a function of timing to previous presynaptic spike. The results are promising for simulated models. I&#8217;d love to see it applied to a well controlled real data.</p>
<p><strong>Active dendrites: adaptation to spike-based communication</strong><br />
Balázs B Ujfalussy, Máté Lengyel</p>
<p>In the presence of correlated presynaptic population activity, to compute a function of presynaptic voltage online from spikes, the neuron has to be nonlinear. In particular, this paper links it to the nonlinear summation property of the dendrite. In previous work by <a title="Synapses with short-term plasticity are optimal estimators of presynaptic membrane potentials" href="http://www.nature.com/neuro/journal/v13/n10/full/nn.2640.html">Pfister, J., Dayan, P., Lengyel, M. (2010)</a>, they explained the role of short-term plasticity (dynamical synapse model) as optimal predictor for presynaptic membrane potential for a single neuron. This work expands it to the population case.</p>
<p><strong>From stochastic nonlinear integrate-and-fire to generalized linear models</strong><br />
Skander Mensi, Richard Naud, Wulfram Gersnter</p>
<p>This poster shows that given a stochastic (adaptive-exponential) leaky-integrate-and-fire-neuron model, it is possible to construct a nearly equivalent GLM model (as a form of spike response model (SRM) with escape noise). Sub-threshold dynamics is linearized to provide the linear filter (corresponding to impulse response) and the reset/refractoriness part of the history filter, while the spike-adaptation is captured as a slower time scale component of the history filter. Then the link function can be estimated through <em>empirical observation</em> that <img src='http://s0.wp.com/latex.php?latex=%5Clog%28-%5Clog%28p%28V%29%29%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log(-&#92;log(p(V)))' title='&#92;log(-&#92;log(p(V)))' class='latex' /> is close to being linear. (I was totally thrown off by the notation <img src='http://s0.wp.com/latex.php?latex=p%28V%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p(V)' title='p(V)' class='latex' /> which was the probability of spiking given a membrane potential, not the marginal distribution of voltage distribution of the model.)</p>
<p><strong>Gaussian process modulated renewal processes</strong><br />
Vinayak Rao, Yee Whye Teh</p>
<p>This is an extension of <a href="http://www.cs.toronto.edu/~rpa/adams-murray-mackay-2009b.shtml"> R. P. Adams, I. Murray and D. J.C. MacKay&#8217;s work </a>which was on Poisson intensity estimation to hazard rate modulated renewal process. Basic ideas are similar; use a sigmoidal link function, and use point process thinning like procedure to exactly sample.</p>
<h3>Tuesday</h3>
<p><strong>Learning in Hilbert vs. Banach spaces: A measure embedding viewpoint<br />
</strong><a href="http://www.gatsby.ucl.ac.uk/~bharath/">Bharath K. Sriperumbudur</a>, Kenji Fukumizu, Gert R. G. Lanckriet</p>
<p>Kernel embedding of probability distribution and induced divergence is an emerging direction of kernel methods. The divergence is related to Bayes risk of Parzen window classifier in particular, and this paper extends the results to Banach spaces. For a Banach space with a norm that is uniformly Fretchet differentiable, and uniformly convex, there is a <a href="http://en.wikipedia.org/wiki/Semi-inner-product">semi-inner product</a> inducing an reproducing kernel Banach space (RKBS) which has analogous properties to RKHS. They showed that kernel embedding is injective when the kernel is a Fourier transform of a signed measure (c.f. <a href="http://en.wikipedia.org/wiki/Bochner%27s_theorem">Bochner&#8217;s theorem</a> requires a positive measure for positive definiteness). The resulting divergence is not computable, unless the semi-metric is of special form, and the convergence rate turns out to be at best same as the RKHS case.</p>
<p><strong>Modelling genetic variations with fragmentation-coagulation processes</strong><br />
Yee Whye Teh, Charles Blundell, Lloyd T. Elliott</p>
<p>Similar to Chinese restaurant process (CRP) for clustering, a temporal evolution of clusters by fragmentation (breaking a table into two tables) and coagulation (merging two tables) can be described as a Fragmentation-Coagulation Process (FCP). They show that FCP is exchangeable, reversible, and has asymptotic distribution of CRP.</p>
<p><strong>Priors over recurrent continuous time processes</strong> [<a href="http://www.stat.ubc.ca/~bouchard/GEP/">code</a>]<br />
Ardavan Saeedi, Alexandre Bouchard-Cŏté</p>
<p>This paper received the best student paper award this year, and Ardavan is only a masters student! The problem he is interested in is a discrete latent state dependent continuous time series with partial observation process. For example, a recurrent disease with coarsely quantified states. He introduces the Gamma-exponential process, where an infinite Markovian transition rate matrix prior is given, extends to hierarchical case, and shows how to do inference.</p>
<p><strong>Kernel Beta process</strong><br />
Lu Ren, Yingjian Wang, David Dunson, Lawrence Carin (none of the authors made it to the conference)</p>
<p>Beta process is a distribution over discrete random measures where each &#8220;stick&#8221; is in <img src='http://s0.wp.com/latex.php?latex=%5B0%2C+1%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='[0, 1]' title='[0, 1]' class='latex' />, but does not sum to 1 as Dirichlet process (DP) does. In this paper, they smooth the sticks in relation to covariates through a kernel, such that their heights are correlated. Kernel here does not have to be positive definte, but only bounded positive functions (like pdf&#8217;s). <del>I&#8217;m curious if a similar approach can be taken for DP.</del> This was originally done in similar fashion for DP by <a href="http://dx.crossref.org/10.1093%2Fbiomet%2Fasn012">Dunson and Park (2008)</a> (&#8216;kernel stick breaking process&#8217;).</p>
<p><strong>Sparse estimation with structured dictionaries</strong><br />
David Wipf</p>
<p>Given an ill-posed problem <img src='http://s0.wp.com/latex.php?latex=Y+%3D+%5CPhi+X+%2B+%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Y = &#92;Phi X + &#92;epsilon' title='Y = &#92;Phi X + &#92;epsilon' class='latex' />, where the dictionary <img src='http://s0.wp.com/latex.php?latex=%5CPhi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;Phi' title='&#92;Phi' class='latex' />, and observation <img src='http://s0.wp.com/latex.php?latex=Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Y' title='Y' class='latex' /> is known, under sparsity assumption this can be solved with <img src='http://s0.wp.com/latex.php?latex=l_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='l_1' title='l_1' class='latex' /> regularization, when <img src='http://s0.wp.com/latex.php?latex=%5CPhi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;Phi' title='&#92;Phi' class='latex' /> is incoherent (roughly independent columns). However, when the dictionary is more structured, it can cause problems. This paper alleviates this problem by transforming the sparse variables which effectively re-normalizes them. It turns out the solution is similar to iteratively reweighted <img src='http://s0.wp.com/latex.php?latex=l_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='l_1' title='l_1' class='latex' /> with a different penality.</p>
<p><strong>Sequence learning with hidden units in spiking neural networks</strong><br />
Johanni Brea, Walter Sen, Jean-Pascal Pfister</p>
<p>Given a point process, the problem is to train a spiking neural network composed of GLM units (including hidden units) that would generate the training patterns. Minimization of KL-divergence between the given point process, and the one parameterized by GLM is done by online gradient descent. The gradient requires marginalization over the spikes of the hidden units: <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7B-%5Cpartial+D_%7BKL%7D%28p%5E%5Cast+%7C%7C+p_%5Ctheta%29%7D%7B%5Cpartial+%5Ctheta%7D+%3D+E%5Cleft%5B+%5Cfrac%7B%5Clog+p%28v%2Ch%3B%5Ctheta%29%7D%7B%5Cpartial+%5Ctheta%7D+%5Cright%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;frac{-&#92;partial D_{KL}(p^&#92;ast || p_&#92;theta)}{&#92;partial &#92;theta} = E&#92;left[ &#92;frac{&#92;log p(v,h;&#92;theta)}{&#92;partial &#92;theta} &#92;right]' title='&#92;frac{-&#92;partial D_{KL}(p^&#92;ast || p_&#92;theta)}{&#92;partial &#92;theta} = E&#92;left[ &#92;frac{&#92;log p(v,h;&#92;theta)}{&#92;partial &#92;theta} &#92;right]' class='latex' />, so they developed an importance sampling scheme where the samples from hidden units are obtained given the training spikes. The resulting training rule is Hebbian, and analogous to STDP. The results are shown when the given distribution is a delta, that is, when the network has to produce exactly one pattern, and that pattern only.</p>
<h3>Wednesday</h3>
<p>I presented <a title="Bayesian Spike Triggered Covariance Analysis" href="http://memming.wordpress.com/2011/12/08/bayesian-spike-triggered-covariance-analysis/">Bayesian Spike Triggered Covariance analysis</a> as a poster:</p>
<p style="text-align:center;"><a href="http://memming.files.wordpress.com/2011/12/nips2011_poster_w88_bstc.jpg"><img class="aligncenter  wp-image-614" title="NIPS2011_poster_W88_BSTC" src="http://memming.files.wordpress.com/2011/12/nips2011_poster_w88_bstc.jpg?w=420&#038;h=314" alt="" width="420" height="314" /></a></p>
<h3 style="text-align:left;">Thursday</h3>
<p><strong>Empirical models of spiking in neural populations</strong><br />
Jakob H. Macke, <a href="http://www.gatsby.ucl.ac.uk/members.html">Lars Büsing</a>, John P. Cunningham, Byron M. Yu, Krishna V. Shenoy, Maneesh Sahani</p>
<p>A comparison study between coupled GLM model and latent variable model (Poisson linear dynamical system) to fit the motor cortex observations (preparation phase only). While GLM explicitly allows only coupled input between the output of the population spiking history, the latent variable model allowed a low dimensional hidden common input source with linear dynamics. They show that the latent variable model fits better and could reconstruct the cross-correlations while the GLM couldn&#8217;t. There were quite a bit of discussions on the floor after the oral presentation. The difference in performance was probably due to (1) relatively large bin size (10 ms), (2) neurons were recorded by Utah array which means low probability of direct connectivity. The coupled GLM was successfully applied to retina where the coupling is local, and the sampling of the neurons were very high with 0.1 ms bin size. It would be interesting to see further developments of latent variable models and GLMs in modeling such motor system data.</p>
<h3>Workshops</h3>
<p><strong>Hierarchical algorithms for χ-armed bandits<br />
</strong><a href="http://researchers.lille.inria.fr/~munos/">Rémi Munos</a><strong><br />
</strong></p>
<p>This was a non-Bayesian invited talk for the <a href="http://www.cs.ubc.ca/~hutter/nips2011workshop/index.html">Bayesian optimization, experimental design and bandits workshop.</a> He talked about his paper for the main conference &#8220;Optimistic Optimization of a Deterministic Function without the Knowledge of its Smoothness&#8221;. In this case, smoothness assumption comes from <img src='http://s0.wp.com/latex.php?latex=f%28x%5E%2A%29+-+f%28x%29+%5Cleq+l%28x%5E%2A%2C+x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x^*) - f(x) &#92;leq l(x^*, x)' title='f(x^*) - f(x) &#92;leq l(x^*, x)' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=l&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='l' title='l' class='latex' /> is a semi-metric, and hence the function is bounded from below around the global maximum by the semi-metric. Then using a hierarchical partitioning of the input space that respects the semi-metric, one can get a bound of the function. (This assumption is  not weaker nor stronger than <a href="http://en.wikipedia.org/wiki/Lipschitz_continuity">Lipschitz continuity</a>, since the absolute value is missing and it is only from the maximum.) When the knowledge of semi-metric is perfect, the convergence rate of the simple regret (best function value) can be exponential (depending on the semi-metric; multiple semi-metrics can give the bound but the convergence rate can differ). When the semi-metric is unknown, and one overestimates the exponent, for example, global convergence is not guaranteed.</p>
<p><a href="http://www.cs.ubc.ca/~hutter/nips2011workshop/papers_and_posters/CameraReady-Dynamic%20Batch%20Bayesian%20Optimization.pdf"><strong>Dynamic Batch Bayesian Optimization</strong></a><br />
<a href="http://web.engr.oregonstate.edu/~azimi/">Javad Azimi</a>, <a href="http://www.ali-jalali.com/">Ali Jalali</a>, Xiaoli Fern</p>
<p>When parallel experiments are possible, experimental design with batch sampling can improve the efficiency, but sequential design often performs better than batch design. Under the assumption that the maximum of the function has a known bound, and using the GP predictive covariance, they choose a set of points that are loosely independent, and could improve the criterion.</p>
<p><strong>Future information minimization as PAC Bayes regularization in Reinforcement Learning</strong><br />
Naftali Tishby</p>
<p>This was the last invited talk for the <a href="http://people.kyb.tuebingen.mpg.de/seldin/fimos.html">New frontiers in model order selection workshop</a>. Tishby talked about reinforcement learning in a <a href="http://en.wikipedia.org/wiki/Partially_observable_Markov_decision_process">POMDP</a> setup, but I couldn&#8217;t fully follow (in fact it went over my head mostly). In a perception-action cycle, the Bellman equation describes the world evolution and associated reward, and he describes a counter part for the agent (mental state?) using an associated Bellman equation with information-to-go (mutual information with respect to a goal).  Then he describes reinforcement learning as a coding problem (relating to <a href="http://en.wikipedia.org/wiki/Kraft%27s_inequality">Kraft&#8217;s inequality</a>, which says subtree of an optimal coding tree is an optimal coding tree). At some point, he reaches PAC-Bayesian bound, and claims that reinforcement learning self-regularizes.</p>
<p><strong>Between the philosophy of science and machine learning</strong><br />
<a href="http://en.wikipedia.org/wiki/David_Corfield">David Corfield</a> [<a href="http://www.kent.ac.uk/secl/philosophy/staff/corfield.html">U of Kent</a>]</p>
<p>This was the first invited talk for the <a href="http://www.dsi.unive.it/PhiMaLe2011/">Philosophy and machine learning workshop</a>. He talked about a broad range of philosophers (of science) and a couple of examples of interaction between ML. The first example was <a href="http://en.wikipedia.org/wiki/Karl_popper">Karl Popper</a>&#8216;s idea of complexity of theory in terms of falsifiable dimensions and its similarity to <a href="http://en.wikipedia.org/wiki/VC_dimension">VC dimension</a> (see <a href="http://dx.doi.org/10.1007/s10838-009-9091-3">their paper in 2009</a> for details). The second example was Judea Pearl&#8217;s use of <a href="http://plato.stanford.edu/entries/causation-counterfactual">counterfactual</a> (by <a href="http://en.wikipedia.org/wiki/David_Kellogg_Lewis">David Lewis</a>), and its impact on philosophy of science. He talked about what kinds of sciences can be benefited from ML, certainly the ones with lots of data. He also went through many philosopher&#8217;s ideas including: Popper, Carnap, Kuhn and Lakatos, <a href="http://en.wikipedia.org/wiki/Paul_Feyerabend">Feyerabend</a>. It is certainly a very fascinating area, but my impression was that we don&#8217;t have much to talk about yet.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/memming.wordpress.com/609/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/memming.wordpress.com/609/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/memming.wordpress.com/609/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/memming.wordpress.com/609/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/memming.wordpress.com/609/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/memming.wordpress.com/609/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/memming.wordpress.com/609/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/memming.wordpress.com/609/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/memming.wordpress.com/609/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/memming.wordpress.com/609/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/memming.wordpress.com/609/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/memming.wordpress.com/609/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/memming.wordpress.com/609/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/memming.wordpress.com/609/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=609&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://memming.wordpress.com/2011/12/21/nips-2011/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/aab80aeaf4754dee5d4a758db50e7213?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">memming</media:title>
		</media:content>

		<media:content url="http://memming.files.wordpress.com/2011/12/nips2011_poster_w88_bstc.jpg" medium="image">
			<media:title type="html">NIPS2011_poster_W88_BSTC</media:title>
		</media:content>
	</item>
		<item>
		<title>Bayesian Spike Triggered Covariance Analysis</title>
		<link>http://memming.wordpress.com/2011/12/08/bayesian-spike-triggered-covariance-analysis/</link>
		<comments>http://memming.wordpress.com/2011/12/08/bayesian-spike-triggered-covariance-analysis/#comments</comments>
		<pubDate>Thu, 08 Dec 2011 16:16:16 +0000</pubDate>
		<dc:creator>memming</dc:creator>
				<category><![CDATA[Neuroscience]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[BSTC]]></category>
		<category><![CDATA[LNP]]></category>
		<category><![CDATA[NIPS]]></category>
		<category><![CDATA[STA]]></category>
		<category><![CDATA[STC]]></category>

		<guid isPermaLink="false">http://memming.wordpress.com/?p=593</guid>
		<description><![CDATA[A widely used tool in neural characterization, where one is interested in the stimulus (or behavior) features that a neuron is sensitive to, is spike triggered averaging  (STA) or otherwise known as reverse correlation analysis [Dayan &#38; Abbott]. At the occurrence of each spike, one averages the stimulus in a window time locked relative to [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=593&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>A widely used tool in neural characterization, where one is interested in the stimulus (or behavior) features that a neuron is sensitive to, is <strong>spike triggered averaging  (STA)</strong> or otherwise known as <strong>reverse correlation</strong> analysis [Dayan &amp; Abbott]. At the occurrence of each spike, one averages the stimulus in a window time locked relative to the spike timing, that potentially causes the spike (or behavior that is caused by the spike) to obtain STA.</p>
<p>It essentially estimates the first order Volterra expansion of the neural response function, that is, approximating a neuron as a linear system. Although neuron is not really a linear system, STA works well in practice. Moreover, it is a consistent estimator for a <em>linear-nonlinear Poisson</em> (LNP) model if the stimulus is white Gaussian noise [Bussgang 1952 in Dayan &amp; Abbott]. In [Paninski 2003] this condition is extended to an arbitrary radially symmetric stimulus that induces non-zero mean response.</p>
<p>When the neuron&#8217;s features space is in low-dimension, but not 1-dimension, then STA is not sufficient, since it recovers only a 1-dimensional subspace. <strong>Spike triggered covariance (STC)</strong> is an extension of STA that can consistently estimate filters of a multi-dimensional LNP model [Paninski 2003]. Let us denote the zero-mean stimulus distribution as <img src='http://s0.wp.com/latex.php?latex=p%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p(x)' title='p(x)' class='latex' />, and the spike triggered distribution as <img src='http://s0.wp.com/latex.php?latex=q%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='q(x)' title='q(x)' class='latex' />. Then, STA is the mean of <img src='http://s0.wp.com/latex.php?latex=%5Chat%7Bq%7D%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;hat{q}(x)' title='&#92;hat{q}(x)' class='latex' /> (empirical estimate of <img src='http://s0.wp.com/latex.php?latex=q%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='q(x)' title='q(x)' class='latex' />), and STC is the eigen-vectors of the covariance matrix of <img src='http://s0.wp.com/latex.php?latex=%5Chat%7Bq%7D%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;hat{q}(x)' title='&#92;hat{q}(x)' class='latex' />. STC is only a consistent estimator when the stimulus distribution is Gaussian [for details, see Paninski 2003].</p>
<p>STA/STC are moment based estimators, and does not have a probabilistic model. Analogous to PPCA (probabilistic principal component analysis) provided a generative model for PCA, allowing Bayesian extensions of PCA, we formulate the <strong>STA/STC problem as a maximum likelihood estimate of a generative model</strong>. Inspired by iSTAC [Pillow &amp; Simoncelli 2006], we extend the LNP model (figure) with exponentiated quadratic nonlinearity. This allows us to put priors on the features, and develop Bayesian estimators. We further extend it to a general family of models, that allows consistent estimation using arbitrary stimulus distribution and flexible class of nonlinearities. <a href="http://books.nips.cc/nips24.html">This result</a> will be presented at <a href="http://nips.cc/Conferences/2011/"><strong>Neural Information Processing Systems (NIPS) 2011</strong></a>. If you are coming to NIPS, it&#8217;s poster W88!</p>
<p><a href="http://memming.files.wordpress.com/2011/12/lnp_schematic.png"><img class="aligncenter size-full wp-image-606" title="LNP schematic" src="http://memming.files.wordpress.com/2011/12/lnp_schematic.png?w=600&#038;h=260" alt="Linear Nonlinear Poisson model with quadratic nonlinearity" width="600" height="260" /></a></p>
<ul>
<li>Dayan &amp; Abbott. <a href="http://www.amazon.com/gp/product/0262541858/ref=as_li_ss_tl?ie=UTF8&amp;tag=memmisprefe-20&amp;linkCode=as2&amp;camp=1789&amp;creative=390957&amp;creativeASIN=0262541858">Theoretical Neuroscience: Computational and Mathematical Modeling of Neural Systems</a>. MIT Press 2001</li>
<li>Paninski. <a href="http://www.ingentaconnect.com/content/tandf/network/2003/00000014/00000003/art00005">Convergence properties of three spike-triggered analysis techniques</a>. Network: Computation in Neural Systems. v 14 p 437-464. 2003</li>
<li>Pillow &amp; Simoncelli. <a href="http://www.journalofvision.org/content/6/4/9">Dimensionality reduction in neural models: An information-theoretic generalization of spike-triggered average and covariance analysis.</a> Journal of Vision. v 6 p 414-428. 2006</li>
</ul>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/memming.wordpress.com/593/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/memming.wordpress.com/593/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/memming.wordpress.com/593/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/memming.wordpress.com/593/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/memming.wordpress.com/593/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/memming.wordpress.com/593/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/memming.wordpress.com/593/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/memming.wordpress.com/593/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/memming.wordpress.com/593/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/memming.wordpress.com/593/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/memming.wordpress.com/593/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/memming.wordpress.com/593/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/memming.wordpress.com/593/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/memming.wordpress.com/593/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=593&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://memming.wordpress.com/2011/12/08/bayesian-spike-triggered-covariance-analysis/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/aab80aeaf4754dee5d4a758db50e7213?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">memming</media:title>
		</media:content>

		<media:content url="http://memming.files.wordpress.com/2011/12/lnp_schematic.png" medium="image">
			<media:title type="html">LNP schematic</media:title>
		</media:content>
	</item>
		<item>
		<title>Poisson like noise in the brain</title>
		<link>http://memming.wordpress.com/2011/07/14/poisson-like-noise-in-the-brain/</link>
		<comments>http://memming.wordpress.com/2011/07/14/poisson-like-noise-in-the-brain/#comments</comments>
		<pubDate>Thu, 14 Jul 2011 19:20:46 +0000</pubDate>
		<dc:creator>memming</dc:creator>
				<category><![CDATA[Neuroscience]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[neural code]]></category>
		<category><![CDATA[neural variability]]></category>
		<category><![CDATA[Poisson process]]></category>
		<category><![CDATA[PPC]]></category>
		<category><![CDATA[STDP]]></category>

		<guid isPermaLink="false">http://memming.wordpress.com/?p=553</guid>
		<description><![CDATA[Spiking activity of cortical neurons seems to be highly irregular and not very repeatable over trials, hence it is often approximated as a Poisson process. But, why does the brain seem to be so noisy? There are a few theories, and I&#8217;d like to discuss a few of them here. Hypothesis 1. Neurons are rate [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=553&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Spiking activity of cortical neurons seems to be highly irregular and not very repeatable over trials, hence it is often approximated as a Poisson process. But, why does the brain seem to be so noisy? There are a few theories, and I&#8217;d like to discuss a few of them here.</p>
<h3>Hypothesis 1. Neurons are rate coding and the variability is noise</h3>
<p>Because of the nature of biological processes, temporal fluctuations of channels, probabilistic behavior or synaptic transmission and so on act as noise sources to the spike train produced by a neuron (e.g. [Carandini]). Therefore, the neuron is a probabilistic unit of processing. The real information can only be retrieved by marginalizing over multiple trials or the population by reducing the noise (or averaging over time). Linear-nonlinear-Poisson (LNP) modeling of neural activity is popular where the instantaneous rate is the signal, and the observation is a result of a Poisson spike generation [Paninski].</p>
<h3>Hypothesis 2. Deterministic transformation of random input</h3>
<p>On the opposite side of the spectrum, there are theories where each neuron is mostly a deterministic processing unit, and the Poisson like response originates from the variability of the natural statistics of the world combined with the uncontrollable internal state. For example, if a white Gaussian noise is presented to a leaky integrate-and-fire neuron, the resulting firing activity is a renewal process. The resulting inter-spike interval distribution is usually not analytically tractable, but when the firing rate is low (or the inter-spike interval is longer than the membrane time constant), it can be well approximated with a Poisson process [Stevens] (sometimes referred to as deterministic Poisson process).</p>
<h3>Hypothesis 3. Maximizing information transfer</h3>
<p>If one views sensory system as maximizing the mutual information of the input and output under certain constraints, the spike trains generated by a stochastic input should be close to a Poisson process. This is because Poisson process is the maximum entropy process among homogeneous point processes with the same rate [Rényi]. There are a few STDP like learning rules derived from such principles [Toyoizumi]. To derive these learning theories neurons themselves are not required to be deterministic, rather it is easier to deal with probabilistic models. But in principle, the neurons are trying to make the spike train as homogeneous Poisson like as possible.</p>
<h3>Hypothesis 4. Brain uses variability to represent uncertainty of belief</h3>
<p>This is the main idea of probabilistic population code (PPC) [Ma]. The source of variability is the uncertainty of the world or the brain&#8217;s model (model mismatch) and the population encoding model P(r|s) summarizes it. The posterior belief (represented as Bayesian probability) about the stimulus given the population spike train responses P(s|r) is proportional to P(r|s)P(s) by Bayes rule. If P(r|s) is composed of Poisson neurons with rate proportional to a corresponding population of tuning curves, then the stimulus uncertainty (assuming normal distribution) is represented by the gain of population firing rate. Simple Bayesian inference from multiple sources can be simply done by adding corresponding firing rates. Note that the coding scheme is rate code, but utilizes the population gain and variability to do Bayesian inference.</p>
<h3>Hypothesis 5. Sampling hypothesis</h3>
<p>Related to PPC, an alternative way of representing a posterior is to sample from it — in this case as a population firing pattern. This is known as the sampling hypothesis [Fiser]. The variability of the neurons is mainly because of the broadness of the posterior. If the brain had no uncertainty about the stimulus, then the posterior will be very narrow, and the firing pattern will have no variability.</p>
<p>These hypotheses are not mutually exclusive, but rather different theories that explain or make use of the &#8220;random nature&#8221; of spiking.</p>
<ul>
<li><span style="font-family:arial;"><span style="font-family:arial;">Carandini, M. Amplification of trial-to-trial response variability by neurons in visual cortex. <em>PLoS Biology</em>, <strong>2004</strong>, 2(9).</span></span></li>
<li><span style="font-family:arial;"><span style="font-family:arial;">Paninski, L. Maximum likelihood estimation of cascade point-process neural encoding models. <em>Network: Comput. Neural Syst., </em><strong>2004</strong><em>, 15</em>, 243-262</span><br />
</span></li>
<li><span style="font-family:arial;">Stevens, C. &amp; Zador, A. When is an Integrate-and-fire Neuron like a Poisson Neuron? <em>Advances in Neural Information Processing Systems, </em><strong>1996</strong><em>, 8</em>, 103-109</span></li>
<li><span style="font-family:arial;">Rényi, A. On an extremal property of the poisson process. <em>Annals of the Institute of Statistical Mathematics, </em><strong>1964</strong><em>, 16</em>, 129-133</span></li>
<li><span style="font-family:arial;"><span style="font-family:arial;">Toyoizumi, T.; Pfister, J.-P.; Aihara, K. &amp; Gerstner, W. Optimality Model of Unsupervised Spike-Timing-Dependent Plasticity: Synaptic Memory and Weight Distribution. <em>Neural Comp., </em><strong>2007</strong><em>, 19</em>, 639-671</span><br />
</span></li>
<li><span style="font-family:arial;"><span style="font-family:arial;">Ma, W. J.; Beck, J. M.; Latham, P. E. &amp; Pouget, A. Bayesian inference with probabilistic population codes. <em>Nature Neuroscience, Nature Publishing Group, </em><strong>2006</strong><em>, 9</em>, 1432-1438</span><br />
</span></li>
<li><span style="font-family:arial;"><span style="font-family:arial;"><span style="font-family:arial;">Fiser, J.; Berkes, P.; Orbán, G. &amp; Lengyel, M. Statistically optimal perception and learning: from behavior to neural representations. <em>Trends in Cognitive Sciences, </em><strong>2010</strong><em>, 14</em>, 119-130</span><br />
</span></span></li>
</ul>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/memming.wordpress.com/553/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/memming.wordpress.com/553/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/memming.wordpress.com/553/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/memming.wordpress.com/553/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/memming.wordpress.com/553/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/memming.wordpress.com/553/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/memming.wordpress.com/553/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/memming.wordpress.com/553/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/memming.wordpress.com/553/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/memming.wordpress.com/553/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/memming.wordpress.com/553/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/memming.wordpress.com/553/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/memming.wordpress.com/553/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/memming.wordpress.com/553/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=553&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://memming.wordpress.com/2011/07/14/poisson-like-noise-in-the-brain/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/aab80aeaf4754dee5d4a758db50e7213?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">memming</media:title>
		</media:content>
	</item>
		<item>
		<title>Interesting talks/posters from COSYNE 2011</title>
		<link>http://memming.wordpress.com/2011/03/08/interesting-talksposters-from-cosyne-2011/</link>
		<comments>http://memming.wordpress.com/2011/03/08/interesting-talksposters-from-cosyne-2011/#comments</comments>
		<pubDate>Tue, 08 Mar 2011 17:03:54 +0000</pubDate>
		<dc:creator>memming</dc:creator>
				<category><![CDATA[Conference]]></category>
		<category><![CDATA[Neuroscience]]></category>
		<category><![CDATA[conference]]></category>
		<category><![CDATA[cosyne]]></category>
		<category><![CDATA[neuroscience]]></category>
		<category><![CDATA[poster]]></category>

		<guid isPermaLink="false">http://memming.wordpress.com/?p=516</guid>
		<description><![CDATA[As I did for the past two CoSyNe meetings (2009, 2010), I made a summary of my personal experience of COSYNE 2011. This year, I also tried broadcasting via twitter together with hooPhilip, ag_smaoineamh, bradleyvoytek, xandram2110 (hashtag #cosyne). Jim DiCarlo gave a very nice analysis of how the reviewing process went (he injected test abstracts [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=516&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>As I did for the past two <a href="http://cosyne.org" target="_blank">CoSyNe</a> meetings (<a title="Interesting talks/posters from COSYNE 2009" href="http://memming.wordpress.com/2009/03/06/interesting-talksposters-from-cosyne-2009/">2009</a>, <a title="Interesting talks/posters from COSYNE 2010" href="http://memming.wordpress.com/2010/03/04/interesting-talksposters-from-cosyne-2010/">2010</a>), I made a summary of my personal experience of <a href="http://www.cosyne.org/c/index.php?title=Cosyne_11" target="_blank">COSYNE 2011</a>. This year, I also tried broadcasting via twitter together with <a href="http://twitter.com/hooPhilip" target="_blank">hooPhilip</a>, <a href="http://twitter.com/ag_smaoineamh" target="_blank">ag_smaoineamh</a>, <a href="http://twitter.com/bradleyvoytek" target="_blank">bradleyvoytek</a>, <a href="http://twitter.com/xandram2110" target="_blank">xandram2110</a> (hashtag #cosyne). Jim DiCarlo gave a very nice analysis of how the reviewing process went (he injected test abstracts to all the reviewers to normalize! he also showed what&#8217;s the expected false positive rate for the selection of talks.)</p>
<p>In my biased eyes, there were many posters advocating the <em>diversity/heterogeneity</em> of neuronal populations this year.</p>
<p><strong>II-33. John Cunningham, Mark Churchland, Matthew Kaufman, Krishna V. Shenoy. Extracting rotational structure from motor cortical data</strong><br />
John Cunningham introduced <span style="color:#000000;"><strong>jPCA</strong></span>, a simple yet powerful method that extracts the most rotational linear projection pairs from a time series. I think this may evolve into a primary visualization tool similar to PCA. The method approximates the data as a linear dynamical system where the Frobenius norm (l2 norm of the vectorized matrix) is used to minimize <img src='http://s0.wp.com/latex.php?latex=%5Cleft%5B+%5Cdot%7B%5Cmathbf%7BX%7D%7D+-+%5Cmathbf%7BM%7D+%5Cmathbf%7BX%7D+%5Cright%5D_F&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;left[ &#92;dot{&#92;mathbf{X}} - &#92;mathbf{M} &#92;mathbf{X} &#92;right]_F' title='&#92;left[ &#92;dot{&#92;mathbf{X}} - &#92;mathbf{M} &#92;mathbf{X} &#92;right]_F' class='latex' /> in the space of skew-symmetric matrices <img src='http://s0.wp.com/latex.php?latex=%5Cmathbf%7BM%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbf{M}' title='&#92;mathbf{M}' class='latex' />. This constraint optimization can be solved easily by a trick.<br />
From dynamical system theory, the real part of eigenvalues of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbf%7BM%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbf{M}' title='&#92;mathbf{M}' class='latex' /> gives rise to convergence or divergence, while the imaginary part of eigenvalues indicate rotation around a fixed point. It was developed to visualize neuronal dynamics that oscillates, but this technique might also be useful for finding limit cycles, or phase synchronized population dynamics in general.</p>
<p><strong>III-32. Mark Churchland, John Cunningham, Matthew Kaufman, Stephen I. Ryu, Krishna V. Shenoy. Firing rate oscillations underlie motor cortex responses during reaching in monkey</strong><br />
This is the sister poster that uses jPCA (II-33) to analyze population dynamics of motor responses. Churchland showed that motor responses of monkey did show rotational dynamics in the later phase, and he believes that the earlier stage of motor planning is to carefully regulate the state of the system at the correct initial condition which is critical for the later dynamics.</p>
<p><strong>T-6. Tony Zador. Cortical circuits underlying auditory processing</strong><br />
He actually changed the title, and outlined an awesome plan to recover complete connectome of the brain. The plan is inspired by <a href="http://www.google.com/images?hl=en&amp;q=brainbow">brainbow</a> where neurons are colored with randomization of DNA sequences. Instead of visually coloring with fluorescent proteins, his plan is to tag them with unique DNA sequence barcodes, and use viruses to transynaptically jointly tag DNAs by combining pairs of neurons that are connected. Assuming the cost and speed of DNA sequencing technology, we would be able to obtain the full connectome of the brain with high probability pretty soon.</p>
<p><strong>T-14. Tim Vogels, Henning Sprekeler, Friedemann Zenke, Claudia Clopath, Wulfram Gerstner. Inhibitory synaptic plasticity generates global and detailed balance of excitation and inhibition</strong><br />
Inhibitory to excitatory spike timing dependent synaptic plasticity rule that could put a network of leaky integrate and fire neurons in a balanced state was presented. This is without changing the excitatory-to-excitatory synapses. In fact, they showed that you can have local excitation structure embedded that could be recalled in this network.</p>
<p><strong>I-86. Alex Huk, Miriam Meister. Neuronal heterogeneity in LIP: Banburismus in the brain, or enigma machine?</strong><br />
Alex Huk showed a collection of neurons that are responsive to memory task and/or discrimination task. The diversity of the population is huge, in terms of temporal organization. Together they support a prolonged activity that is usually seen in population averaged histograms.</p>
<p><strong>II-40. Shimazaki Hideaki, Emery N. Brown. Constructing a joint time-series model of continuous and Bernoulli/Poisson processes using a copula<br />
</strong>Copula is a powerful way of capturing dependencies between variables. There have been a several applications of copula in neural data, but it was always between a same type of data; continuous pairs or discrete pairs. Shimazaki is developing a way of capturing the dependence between continuous varaibles and a point process. So far he can capture instantaneous coupling by using an empirical conditional joint between the continuous variable and the spikes. This is possible because essentially the binary spiking can be considered a binary random variable IMHO.</p>
<p><strong>II-70. Olav Stetter, Demian Battaglia, Jordi Soriano, Theo Geisel. State-dependent network reconstruction from calcium imaging signals<br />
</strong>Olav presented their recent effort to enhance measures of causality/connectivity. They are working on analyzing the dynamics of cortical culture. He was not happy with traditional measure such as Granger causality and transfer entropy. He proposed a set of extensions: (1) high-pass filter, (2) capture instantaneous power as well, (3) use transfer entropy, (4) condition on population state. The last extension is very interesting. The network dynamics in this case has clear states:</p>
<p><strong>II-12. Bingni W Brunton, Carlos D Brody. Optimal integration of decision-making evidence in the rat<br />
</strong>This is a continuation of the work presented last year at COSYNE by the same authors. Poisson clicks were presented for a different rate on each ear for a fixed duration of 150-800 ms, and the animal (rat) has to decide which side had more clicks. From the psychophysics only, they modeled a drift to decision model with 6 parameters (leakage, sensory noise, accumulator noise, bound, facilitation/depression?). From the likelihood of the parameter space, they conclude that the rat has zero accumulator noise, which means the rats can count pretty well, and keep it in memory pretty well too.</p>
<p><strong>III-33. Wieland Brendel, Naoshige Uchida, Ranulfo Romo, Christian K Machens. How to deal with the heterogeneity of neural responses: A demixing method</strong><br />
Given a set of data that can be conditioned on several categorical conditions, the goal is to find a common set of basis vectors that is related to one of the categorical condition at a time, while explaining the variance. A variant of PCA which is somewhat related to Fisher discriminant is introduced for this purpose they call DPCA. Essentially the conditional covariance matrices (marginalizing out other variables) are computed, and the cost function for PCA is weighted by the fraction of contribution from these covariances to the total.<br />
(update Aug, 2011: Christian Machens has a <a title="Demixing population activity in higher cortical areas " href="http://dx.doi.org/10.3389/fncom.2010.00126" target="_blank">related paper</a> in Frontiers Computational Neuroscience.)</p>
<p><strong>T-3. Anmo J. Kim , Aurel A. Lazar, Yevgeniy B. Slutski. Drosophila projection neurons encode the acceleration of time-varying odor waveforms</strong><br />
Anmo showed the transfer function of early olfactory processing using precisely controlled temporal odor stimulation. Two serial stages of differentiation effect of the input was demonstrated from olfactory sensory neuron (OSN) and projection neurons (PN). Hence, the effective signal output (spike rate) of PN was the acceleration.</p>
<p><strong>II-76. Alexandra Smolyanskaya, Stephen G Lomber, Richard Born. Individual neurons in MT have significant detect probabilities for motion and depth detection tasks<br />
</strong>Using cooling, she could change tuning curve properties of visual cortex. Depth tuning and directional tuning both changed, but with different amounts. Interestingly, the performance of the neurons (measured with detection probability (same as choice probability)) degraded more for depth, if I remember correctly, while the tuning curves were worse for depth.</p>
<p><strong>I-42. Il Memming Park, Miriam Meister, Alexander Huk, Jonathan W Pillow. Detailed encoding and decoding of choice-related information from LIP spike trains</strong><br />
My poster was on the first night (hence less coverage of posters on the same day). We analyzed decision making process from a statistical modeling perspective.</p>
<p><span style="font-size:20px;font-weight:bold;">Workshops</span></p>
<p><strong>Wolf Singer. The role of oscillatory phenomena in sensory processing<br />
</strong>He was much more careful to claim the usefulness of temporal code compared to his keynote speech at IJCNN 2007. He gave various examples of dissociation between oscillation/synchrony dynamics and rate codes, but none of them was conclusive evidence that the brain utilizes them to compute. Curiosity, attention, expectancy showed modulation of oscillatory components, and he talked about <a href="http://books.nips.cc/nips19.html" target="_blank">NIPS 2006</a> paper on liquid state machine like experiment on visual cortex (Nikolic et al.).</p>
<p><strong>Aysegul Gunduz, Gerwin Schalk. The dynamics of attentional shift reflected in electrocorticography<br />
</strong>Aysegul talked about how attention modulates ECoG signal amplitudes in human, and demonstrated that it is possible to decode the spacial attention from ECoG as well. Higher frequency activity on different areas were consistently involved and formed an &#8220;attention network&#8221;.</p>
<p>On the second day of the workshop, I spent most of my time in the closing the loop: new techniques for online neural characterization and optimal control. The topics were closely related to optimal experiment design, and active learning. Greg Horwitz gave a great opening remark on how the invention of computer gave rise to laziness of the experimentalists. For example, in the old days to find V1 receptive fields, people moved a moving light bulb while listening to the neuron, but now we stimulate with a computer in an open loop most of the time. This workshop was very interesting to me. I hoped that some of the workshops on the second day would have been on the first day though.</p>
<p><strong>Greg Horwitz. Characterizing spatial iso-response surfaces of retinal ganglion cells<br />
</strong>In a neural response model where the input is combined into a single quantity, iso-response surface captures how the inputs are combined. In chromatic tuning of V1 neurons, the widely used model is linearly combining each cone output followed by a nonlinearity, which results in a linear manifold in the space of cone contrast. However, when measured adaptively through feedback to find the most non-linear part of the iso-response surface, he found some cells had more quadratic basis, or even elliptical. The adaptive strategy was to model the iso-response surface as a collection of triangles, and refine the details by probing the center of the most non-linear polygon and doing so with interleaving stimulus to avoid adaptation of the neuronal response.</p>
<p><strong>Yashar Amadian. Design of optimal stimuli to control neuronal spiking<br />
</strong>Extending last year&#8217;s poster of his, he talked about how to use a probabilistic modeling of the system (stimulus to spiking), and constraints (maximum amplitude, smoothness of waveform) to optimize for a stimulation pattern that would produce as precisely timed spike train as the target. The loss function he used for the target was an instantaneous Dirac delta function which would be zero if and only if the target spike occurs within the bin.</p>
<p><strong>Kechen Zhang. Optimal stimulus design and network structure</strong><br />
He talked about neural network theory in system identification of how constraints that bound the parameter space to a closed region in a quadratic neural network model can frequently give rise to at least one parameter on the boundary. This is because the eigenstructure of the quadratic model results in saddles of the response surface, as well as the fact that the continuous map from parameter space to response surface forcing the boundary of the parameter space to be mapped to the boundary in the constrained response surface. He also talked about how the network is ambiguous under certain conditions (power gain function and converging feedforward network).</p>
<p>See also <a title="The 8th annual computational and systems neuroscience (Cosyne) meeting " href="http://www.neuralsystemsandcircuits.com/content/1/1/8" target="_blank">Mark H. Histed and Jonathan W. Pillow&#8217;s report about the meeting</a></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/memming.wordpress.com/516/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/memming.wordpress.com/516/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/memming.wordpress.com/516/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/memming.wordpress.com/516/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/memming.wordpress.com/516/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/memming.wordpress.com/516/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/memming.wordpress.com/516/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/memming.wordpress.com/516/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/memming.wordpress.com/516/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/memming.wordpress.com/516/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/memming.wordpress.com/516/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/memming.wordpress.com/516/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/memming.wordpress.com/516/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/memming.wordpress.com/516/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=516&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://memming.wordpress.com/2011/03/08/interesting-talksposters-from-cosyne-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/aab80aeaf4754dee5d4a758db50e7213?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">memming</media:title>
		</media:content>
	</item>
		<item>
		<title>NIPS 2010 main conference</title>
		<link>http://memming.wordpress.com/2010/12/14/nips-2010-main-conference/</link>
		<comments>http://memming.wordpress.com/2010/12/14/nips-2010-main-conference/#comments</comments>
		<pubDate>Tue, 14 Dec 2010 18:39:49 +0000</pubDate>
		<dc:creator>memming</dc:creator>
				<category><![CDATA[Conference]]></category>
		<category><![CDATA[Event]]></category>
		<category><![CDATA[conference]]></category>
		<category><![CDATA[kernel method]]></category>
		<category><![CDATA[learning theory]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[neuroscience]]></category>
		<category><![CDATA[NIPS]]></category>
		<category><![CDATA[semi-definite programming]]></category>
		<category><![CDATA[submodular]]></category>
		<category><![CDATA[SVM]]></category>
		<category><![CDATA[universal kernel]]></category>

		<guid isPermaLink="false">http://memming.wordpress.com/?p=471</guid>
		<description><![CDATA[This was my first NIPS (Neural Information Processing Systems). It is primarily a machine learning conference with some neural inspirations. According to the analysis done by the organizers, there were more than 1350 registered participants, 1256 submissions with 24.1% acceptance rate. Some of the keywords that have high mutual information with the acceptance are &#8216;tree-width, subgradient, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=471&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>This was my first <a href="http://nips.cc" target="_blank">NIPS (Neural Information Processing Systems)</a>. It is primarily a machine learning conference with some neural inspirations. According to the analysis done by the organizers, there were more than <em>1350 registered participants, 1256 submissions with 24.1% acceptance rate</em>. Some of the keywords that have high mutual information with the acceptance are &#8216;tree-width, subgradient, concave&#8217;. From my perspective, I found non-parameteric Bayesian inference, reinforcement learning, semi-definite programming, and sparse/low-rank learning very popular along with general learning theory.</p>
<p>I summarized some of the posters that were interesting to me. If there is anything wrong, it&#8217;s my fault, and please let me know. You can find all the papers from the <a href="http://books.nips.cc/nips23.html" target="_blank">online proceedings</a>.</p>
<p><strong>Update</strong>: video recordings of the talks <a href="http://videolectures.net/nips2010_vancouver/" target="_blank">[invited talks]</a> <a href="http://videolectures.net/nips2010_oral_sessions/" target="_blank">[oral sessions]</a>.</p>
<h3>Day 1</h3>
<p><strong>Efficient algorithms for learning kernels from multiple similarity matrices with general convex loss functions</strong><br />
Achintya Kundu, vikram Tankasali, Chiranjib Bhattacharyya, Aharon Ben-Tal</p>
<p>Aiming to learn a positive semi-definite kernel matrix from a given set of similarity measures, and a classification problem, this paper uses <em>subgradient</em> with <em>mirror descent</em> optimization. They combine the SVM cost with loss function on the similarity matrices. For the mirror descent, they chose negative of matrix entropy (unnormalized entropy of eigenvalue distribution) that guarantees  the convergence.</p>
<p><strong>Generative Local Metric Learning for Nearest Neighbor Classification</strong><br />
Yung-Kyun Noh, Byoung-Tak Zhang, Daniel Lee</p>
<p>Using the asymptotic NN-classifier error bound <img src='http://s0.wp.com/latex.php?latex=%5Cint+%5Cfrac%7Bp%28x%29q%28x%29%7D%7Bp%28x%29%2Bq%28x%29%7D%5Cmathrm%7Bd%7Dx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;int &#92;frac{p(x)q(x)}{p(x)+q(x)}&#92;mathrm{d}x' title='&#92;int &#92;frac{p(x)q(x)}{p(x)+q(x)}&#92;mathrm{d}x' class='latex' /> which is measure-independent, the goal is to learn a global Euclidean metric scaling for <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BR%7D%5ED&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{R}^D' title='&#92;mathbb{R}^D' class='latex' />. The critical problem is that the theoretically measure-independent quantity becomes measure-dependent when estimated from finite samples. By first order expansion of the probability estimated by the nearest neighbor estimator, the authors explicitly derive the bias, and tries to learn the metric while reducing the bias. It is posed as a semi-definite programming (SDP).</p>
<p><strong>Estimation of Rényi Entropy and Mutual Information Based on Generalized Nearest-Neighbor Graphs</strong><br />
David Pal, Barnabas Poczos, Csaba Szepesvari</p>
<p>There have been works on Rényi&#8217;s entropy estimation from nearest-neighbor, but this one proves strong convergence results. In addition, they show that given a consistent estimator for Rényi&#8217;s entropy, one can use empirical copula to obtain a consistent estimator for Renyi&#8217;s mutual information. The fact that the marginals are uniform make it very simple.</p>
<p><strong>Random Conic Pursuit for Semidefinite Programming</strong><br />
Ariel Kleiner, Ali Rahimi, Michael Jordan</p>
<p>According to the authors, interior point method for SDP is slow and tends to jump to the solution instead of slowly converging. As an alternative, they propose a stochastic gradient descent, where for every iteration a random rank 1 positive definite matrix is chosen, and the optimization is solved for the 2-dimensional solution plane instead of the entire cone of semi-definite matrices.</p>
<p><strong>Short-term memory in neuronal networks through dynamical compressed sensing</strong><br />
Surya Ganguli, Haim Sompolinsky</p>
<p>When sparse signal is provided as a single common input to a network with linear recurrent dynamics, the problem of recovering past input signals from current activation pattern can be formulated as an compressed sensing problem. They built a theory of memory depth under this setting where the recurrent dynamics matrix is orthogonal. The assumptions are not very realistic, however, viewing the problem from a compressive sensing point of view connects to the <a href="http://www.cnel.ufl.edu/research/BMI.php" target="_blank">FWIRE project</a>.</p>
<h3>Day 2</h3>
<p><strong>Machine Learning with HumanIntelligence: Principled Corner Cutting (PC2) &#8211; invited talk</strong><br />
Xiao Li Meng</p>
<p>This was an inspiring invited talk with a practical philosophy. Self-consistency principle as a general method to deal with missing data was fresh to me. Compromising statistical principles for more efficient methods that resembles machine learning principles were discussed with personal experiences. <a href="http://videolectures.net/nips2010_meng_mlhi/" target="_blank">[video]</a></p>
<p><strong>Identifying Dendritic Processing</strong><br />
<a title="Aurel A. Lazar" href="http://www.ee.columbia.edu/~aurel/" target="_blank"> Aurel A. Lazar</a>, Yevgeniy Slutskiy</p>
<p>Extending the time encoding machine and perfect reconstruction techniques, the authors propose estimating a linear filter followed by a non-noisy integrate-and-fire neuron. Interestingly, due to causality the dendritic process (modeled as a linear filter) has infinite spectrum which conflicts with the (RKHS) theory of band limited functions required for the time encoding machine theory. As a result, they can only recover the filter projected in the band limited space. Moreover, using a clever trick of switching the input and the filter, multiple presentations of different input can be combined to estimate the filter even under the usual Nyquist spiking rate limit.</p>
<p><strong>Near-Optimal Bayesian Active Learning with Noisy Observations</strong><br />
Daniel Golovin, <a href="http://www.cs.caltech.edu/~krausea/" target="_blank">Andreas Krause</a>, Debajyoti Ray</p>
<p>Determining the optimal hypothesis (within a finite set) with a few observation is a problem of active learning. Counter intuitively, the choice that provides maximal information gain do NOT perform well, and in pathological problems the posterior does not eliminate choices. Instead they show the choice that reduces the posterior probability mass the most is adaptive submodular, and works under noisy observations. The main contributions are efficient algorithms that implement such policy.</p>
<p><strong>Robust PCA via Outlier Pursuit</strong><br />
Huan Xu, Constantine Caramanis, <a href="http://users.ece.utexas.edu/~sanghavi/" target="_blank">Sujay Sanghavi</a></p>
<p>When PCA is formulated as SVD on the data matrix, outliers are defined as corrupted columns that do not lie in the low dimensional space. Using a combination of nuclear norm (sum of singular values) and L1 norm of the column-wise L2 norm, they propose a convex optimization formulation. The proposed method has deterministic error bound related to the incoherence parameter.</p>
<p><strong>Global Analytic Solution for Variational Bayesian Matrix Factorization<br />
</strong>Shinichi Nakajima, Masashi Sugiyama, Ryota Tomioka</p>
<p>The problem of SVD is posed as a generative model. Gaussian model for a low rank matrix is assumed, and solved with analytical solutions using variational Bayes approach of factorizing the posterior of columns of the low rank matrix. Without assistance of symbolic mathematical tools such as mathematica, the authors arrived at analytic solutions with flat prior as well as ARD prior. (The full equations for the solutions were too lengthy to be included in the poster.)</p>
<p><strong>Improvements to the Sequence Memoizer</strong><br />
Jan Gasthaus and Yee Whye Teh</p>
<p>Using hierarchical Pitman-Yor process (a &#8220;heavy-tail&#8221; version of Dirchlet process with extra parameters) as the prior, the aim is to learn a distribution of a symbol conditioned on all previous symbols. An application is compression where the inference process is replicated in the decoding end with same random seed. I&#8217;m new to non-parameteric Bayesian inference, so the details slipped through me, but I would definitely revisit this paper.</p>
<p><strong>Global seismic monitoring as probabilistic inference</strong><br />
<a href="http://www.cs.berkeley.edu/~nimar/" target="_blank">Nimar S. Arora</a>, Stuart Russell, Paul Kidwell, Erik Sudderth</p>
<p>From a collection of seismic sensors, they aim to find a probable cause (artificial nuclear activity). Using prior knowledge combined with a heuristic Bayesian inference algorithm, they infer a sequence of events (with position and amplitude) as causes. The inference is done through operations on a Poisson process initialized sequence as a hypothesis and improve it via adding, aligning, reassigning, and pruning; simiar to dependent DP process by Dahua Lin (below). I think a similar process can be used to infer hidden causes of spike train observations in a generative point process modeling.</p>
<h3>Day 3</h3>
<p><strong><a href="http://dahuasky.wordpress.com/2010/12/02/nips-outstanding-student-paper-award/" target="_blank">Construction of Dependent Dirichlet Processes based on Poisson Processes</a></strong><a href="http://dahuasky.wordpress.com/2010/12/02/nips-outstanding-student-paper-award/" target="_blank"><br />
</a><a href="http://people.csail.mit.edu/dhlin/" target="_blank"> Dahua Lin</a>, Eric Grimson, <a href="http://people.csail.mit.edu/fisher/" target="_blank">John Fisher</a></p>
<p>This paper got the best student paper award. Dirichlet process is related to a compound Poisson process known as <em>Gamma process</em>; Gamma process takes positive values independently distributed as Gamma distribution at each event generated by a Poisson process. Dirichlet process is simply a normalized form of Gamma process, such that the &#8220;amplitude&#8221; of Gamma process at each point represents the probability of being at that position. It is well known that Poisson process is the only completely random process, hence one can perform operations such as thinning, superposition, and random translation to create a similar Poisson process. The authors create dependent Dirichlet process through these operations and apply it to non-stationary clustering problem by developing Gibbs sampling for it.</p>
<p><strong>A Novel Kernel for Learning a Neuron Model from Spike Train Data<br />
</strong>Nicholas K. Fisher, <a href="http://www.cise.ufl.edu/~arunava/" target="_blank">Arunava Banerjee</a></p>
<p>Neuron modeling is posed as a classification problem for input spike trains that do not elicit an action potential vs. that do. SVM is used with conjunction to a novel spike train kernel of the form <img src='http://s0.wp.com/latex.php?latex=%5Csum_i+%5Csum_j+%5Cfrac%7Bt%5E1_i+t%5E2_j%7D%7Bt%5E1_i+%2B+t%5E2_j%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_i &#92;sum_j &#92;frac{t^1_i t^2_j}{t^1_i + t^2_j}' title='&#92;sum_i &#92;sum_j &#92;frac{t^1_i t^2_j}{t^1_i + t^2_j}' class='latex' />. This was derived from a combination of basis functions <img src='http://s0.wp.com/latex.php?latex=f_t+%3D+%5Cfrac%7B1%7D%7B%5Ctau%7D+%5Cexp%28-%5Cbeta+%2F+t%29+%5Cexp%28-t%2F%5Ctau%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f_t = &#92;frac{1}{&#92;tau} &#92;exp(-&#92;beta / t) &#92;exp(-t/&#92;tau)' title='f_t = &#92;frac{1}{&#92;tau} &#92;exp(-&#92;beta / t) &#92;exp(-t/&#92;tau)' class='latex' /> for each pair of spike timings they call REEF (reciprocal exponential &#8211; exponential function). The kernel is the uniform integration of outer product of REEF. This is a novel binless spike train positive definite kernel that weights the spikes in time non-uniformly (like spikernel). Here they applied to learning SRM<sub>0</sub> (simplest spike response model), but I think this has potential to be applied to arbitrary neuron models.</p>
<p><strong>Universal Kernels on Non-Standard Input Spaces<br />
</strong><a href="http://www.stoch.uni-bayreuth.de/de/CHRISTMANN/" target="_blank">Andreas Christmann</a>, <a href="http://www.isa.uni-stuttgart.de/Steinwart/index.t?lang=en" target="_blank">Ingo Steinwart</a></p>
<p>Universal kernels have nice convergence properties and learning guarantees (on a compact set); Guassian kernel and Laplacian kernels are widely used because of this property. The authors show that one can construct universal kernels from a simple form <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bn%3D0%7D%5E%5Cinfty+a_n+t%5En&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{n=0}^&#92;infty a_n t^n' title='&#92;sum_{n=0}^&#92;infty a_n t^n' class='latex' />. When <img src='http://s0.wp.com/latex.php?latex=a_n+%3E+0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_n &gt; 0' title='a_n &gt; 0' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=k%28x%2Cy%29+%3D+%5Csum_%7Bn%3D0%7D%5E%5Cinfty+%3Cx%7Cy%3E%5En_%7Bl_2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='k(x,y) = &#92;sum_{n=0}^&#92;infty &lt;x|y&gt;^n_{l_2}' title='k(x,y) = &#92;sum_{n=0}^&#92;infty &lt;x|y&gt;^n_{l_2}' class='latex' /> is a universal kernel in a closed ball. Extending this result, if one can find a continuous injective mapping to a separable Hilbert space, by Mercer theorem and the above result, one can build a universal kernel from any compact space. Two interesting examples of kernels on probability space are given; one for characteristic kernel and embedded probability, and the other is using (generalized) characteristic functions of probabilities.</p>
<p>Of course <a title="Extended KS and CM test for point processes" href="http://memming.wordpress.com/2010/12/05/extended-ks-and-cm-test-for-point-processes/">my poster</a> was on the third day as well, so I had little time for other&#8217;s posters. (Update: check out <a href="http://nipsposterface.com/" target="_blank">http://nipsposterface.com/</a>)</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/memming.wordpress.com/471/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/memming.wordpress.com/471/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/memming.wordpress.com/471/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/memming.wordpress.com/471/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/memming.wordpress.com/471/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/memming.wordpress.com/471/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/memming.wordpress.com/471/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/memming.wordpress.com/471/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/memming.wordpress.com/471/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/memming.wordpress.com/471/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/memming.wordpress.com/471/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/memming.wordpress.com/471/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/memming.wordpress.com/471/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/memming.wordpress.com/471/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=471&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://memming.wordpress.com/2010/12/14/nips-2010-main-conference/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/aab80aeaf4754dee5d4a758db50e7213?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">memming</media:title>
		</media:content>
	</item>
		<item>
		<title>Extended KS and CM test for point processes</title>
		<link>http://memming.wordpress.com/2010/12/05/extended-ks-and-cm-test-for-point-processes/</link>
		<comments>http://memming.wordpress.com/2010/12/05/extended-ks-and-cm-test-for-point-processes/#comments</comments>
		<pubDate>Sun, 05 Dec 2010 13:47:00 +0000</pubDate>
		<dc:creator>memming</dc:creator>
				<category><![CDATA[Conference]]></category>
		<category><![CDATA[Poster]]></category>
		<category><![CDATA[CM test]]></category>
		<category><![CDATA[divergence]]></category>
		<category><![CDATA[iocane]]></category>
		<category><![CDATA[KS test]]></category>
		<category><![CDATA[NIPS]]></category>
		<category><![CDATA[point process]]></category>
		<category><![CDATA[sensory prosthesis]]></category>

		<guid isPermaLink="false">http://memming.wordpress.com/?p=452</guid>
		<description><![CDATA[Traditionally in neuroscience, information in spike trains is thought to be in the total number of spikes (within a small interval) — known as the rate code. Other than the number of spikes, the precise pattern of action potentials are often discarded. Hence, in many applications where changes in the signal the brain encodes is expected [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=452&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Traditionally in neuroscience, information in spike trains is thought to be in the total number of spikes (within a small interval) — known as the rate code. Other than the number of spikes, the precise pattern of action potentials are often discarded. Hence, in many applications where changes in the signal the brain encodes is expected to change, neuroscientists first check if the mean rate of their observed spike train is changing. Detecting the change in the mean firing rate has been successful in virtually every area of neuroscience including sensory, working memory, decision making, motor systems. However, not much attention is given to the information that is included in the precise firing pattern. Information theoretic analysis and other temporal code analyses are the currently widely used options for probing such information. But there have been a lack of tools for <strong>detecting arbitrary differences in spike train observations under different conditions</strong>. Assuming no prior knowledge, using <a href="http://en.wikipedia.org/wiki/Divergence_(statistics)" target="_blank">statistical divergences</a> is a natural choice of statistic for this problem. In real valued random variables, Kullback-Leibler divergence, <a title="Significance test for spike trains based on finite point process estimation" href="http://memming.wordpress.com/2009/10/15/significance-test-for-spike-trains-based-on-finite-point-process-estimation/" target="_blank">Hellinger-divergence</a>, or total variation would theoretically serve one&#8217;s purpose, however, they are not always easy to estimate. On the other hand, simpler non-parameteric test statistics such as <a href="http://en.wikipedia.org/wiki/KS_test#Two-sample_Kolmogorov.E2.80.93Smirnov_test" target="_blank">Kolmogorov-Smirnov (KS) test statistic</a>, or <a href="http://en.wikipedia.org/wiki/Cram%C3%A9r%E2%80%93von_Mises_criterion" target="_blank">Cramér-von-Mises (CM) test statistic</a> are very easy to estimate and powerful enough for many cases. Both KS and CM test statistics can be applied to discriminate the difference between arbitrary distributions. On the other hand, KS test requires the underlying space to have a full ordering, while CM test statistic requires square integrability of difference of distribution functions. Hence it is non-trivial to extend it to the point process domain where such structures do not exist naturally.</p>
<p><a href="http://nips.cc/Conferences/2010/" target="_blank">NIPS 2010</a> starts tomorrow, and we have a poster for the <a title="Parameter free divergences for point processes" href="http://memming.wordpress.com/2010/01/26/parameter-free-divergences-for-point-processes/" target="_blank">previously mentioned</a> extension of KS and CM test as point process divergences I have developed with <a href="http://sites.google.com/site/sohanseth/" target="_blank">Sohan Seth</a>. You can glimpse at the poster below, but as you can see it is fairly compressed. So if you are attending NIPS 2010, please stop by to our poster and we&#8217;ll be glad to explain the details (<a href="http://nips.cc/Conferences/2010/Program/event.php?ID=2149" target="_blank">Wednesday Dec 8th, W59</a>, &#8220;A novel family of non-parametric cumulative based divergences for point processes&#8221;).</p>
<p><a href="http://memming.files.wordpress.com/2010/12/nips_2010_poster_memming_v3.png"><img class="aligncenter size-full wp-image-453" title="NIPS 2010 poster" src="http://memming.files.wordpress.com/2010/12/nips_2010_poster_memming_v3.png?w=600&#038;h=384" alt="" width="600" height="384" /></a></p>
<p>One application of a point process divergence is <a href="http://en.wikipedia.org/wiki/Neuroprosthetics">sensory neuralprothesis</a>. The goal of sensory prosthetics is to artificially recreate the activation and inactivation of the brain so that a percept of sensory stimulation would be formed. The current approach is to electrically stimulate the thalamus along the sensory pathway. However, given only a limited number of electrodes, it is difficult to exactly mimic the natural pattern of firing the peripheral nerves. Therefore, a search of stimulation parameter is necessary to find the best possible setting. <a title="Austin J. Brockmeier" href="http://sites.google.com/site/ajbrockmeier/">Austin Brockmeier</a> analyzed a set of stimulation patterns to VPL of anesthetized rat (from <a href="http://joefrancislab.com/">Francis Lab</a>) and demonstrated the usefulness and advantage of the proposed divergences compared to mean rate. I am very pleased to see the method working in practice.</p>
<p>The implementation is straightforward, and included in the <a href="http://memming.wordpress.com/tag/iocane/" target="_blank">IOCANE</a> <a href="http://code.google.com/p/iocane" target="_blank">open source project</a>.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/memming.wordpress.com/452/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/memming.wordpress.com/452/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/memming.wordpress.com/452/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/memming.wordpress.com/452/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/memming.wordpress.com/452/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/memming.wordpress.com/452/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/memming.wordpress.com/452/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/memming.wordpress.com/452/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/memming.wordpress.com/452/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/memming.wordpress.com/452/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/memming.wordpress.com/452/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/memming.wordpress.com/452/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/memming.wordpress.com/452/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/memming.wordpress.com/452/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=452&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://memming.wordpress.com/2010/12/05/extended-ks-and-cm-test-for-point-processes/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/aab80aeaf4754dee5d4a758db50e7213?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">memming</media:title>
		</media:content>

		<media:content url="http://memming.files.wordpress.com/2010/12/nips_2010_poster_memming_v3.png" medium="image">
			<media:title type="html">NIPS 2010 poster</media:title>
		</media:content>
	</item>
		<item>
		<title>Integration of product of functions on a probability simplex</title>
		<link>http://memming.wordpress.com/2010/09/09/integration-of-product-of-functions-on-a-probability-simplex/</link>
		<comments>http://memming.wordpress.com/2010/09/09/integration-of-product-of-functions-on-a-probability-simplex/#comments</comments>
		<pubDate>Thu, 09 Sep 2010 16:39:27 +0000</pubDate>
		<dc:creator>memming</dc:creator>
				<category><![CDATA[Mathematics]]></category>
		<category><![CDATA[tips and tricks]]></category>

		<guid isPermaLink="false">http://memming.wordpress.com/?p=413</guid>
		<description><![CDATA[I saw this trick in Wolpert and Wolf 1995, and I thought I would share it. Given a product of functions over a probability mass function where K is the cardinality of alphabets, and real valued function , we want to compute the integral of this over the probability simplex ( are non-negative and sum [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=413&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I saw this trick in Wolpert and Wolf 1995, and I thought I would share it.</p>
<p>Given a product of functions <img src='http://s0.wp.com/latex.php?latex=%5Cprod_%7Bi%3D1%7D%5EK+h_i%28p_i%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;prod_{i=1}^K h_i(p_i)' title='&#92;prod_{i=1}^K h_i(p_i)' class='latex' /> over a probability mass function <img src='http://s0.wp.com/latex.php?latex=%5Cleft%5C%7B+p_i+%5Cright%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;left&#92;{ p_i &#92;right&#92;}' title='&#92;left&#92;{ p_i &#92;right&#92;}' class='latex' /> where K is the cardinality of alphabets, and real valued function <img src='http://s0.wp.com/latex.php?latex=h_i%28%5Ccdot%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='h_i(&#92;cdot)' title='h_i(&#92;cdot)' class='latex' />, we want to compute the integral of this over the <a href="http://en.wikipedia.org/wiki/Simplex#Probability">probability simplex</a> (<img src='http://s0.wp.com/latex.php?latex=p_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_i' title='p_i' class='latex' /> are non-negative and sum up to 1). The trick is to realize that the integral can be written in convolutions, and use the <a href="http://planetmath.org/encyclopedia/LaplaceTransformOfConvolution.html">convolution theorem for Laplace transform</a>.</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cint_0%5E1+%5Cint_0%5E%7B1+-+p_1%7D+%5Ccdots+%5Cint_0%5E%7B1-%5Csum_%7Bi%3D1%7D%5E%7BK-1%7D+p_i%7D+%5Cprod+h%28p_i%29+dp_1+dp_2+%5Ccdots+dp_%7BK-1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;int_0^1 &#92;int_0^{1 - p_1} &#92;cdots &#92;int_0^{1-&#92;sum_{i=1}^{K-1} p_i} &#92;prod h(p_i) dp_1 dp_2 &#92;cdots dp_{K-1}' title='&#92;int_0^1 &#92;int_0^{1 - p_1} &#92;cdots &#92;int_0^{1-&#92;sum_{i=1}^{K-1} p_i} &#92;prod h(p_i) dp_1 dp_2 &#92;cdots dp_{K-1}' class='latex' /></p>
<p>Note that <img src='http://s0.wp.com/latex.php?latex=p_K&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_K' title='p_K' class='latex' /> is not integrated, since it is determined by the rest of the variables. Define <img src='http://s0.wp.com/latex.php?latex=%5Ctau_j+%3D+1+-+%5Csum_%7Bi%3D1%7D%5E%7Bj%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;tau_j = 1 - &#92;sum_{i=1}^{j}' title='&#92;tau_j = 1 - &#92;sum_{i=1}^{j}' class='latex' />. Note that <img src='http://s0.wp.com/latex.php?latex=p_j+%3D+%5Ctau_%7Bj-1%7D+-+p_%7Bj-1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_j = &#92;tau_{j-1} - p_{j-1}' title='p_j = &#92;tau_{j-1} - p_{j-1}' class='latex' />. Rewrite the integral as,</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cint_0%5E1+%5Cint_0%5E%7B%5Ctau_1%7D+%5Ccdots+%5Cint_0%5E%7B%5Ctau_%7BK-2%7D%7D+%5Cprod_%7Bi%3D1%7D%5E%7BK-2%7D+h_i%28p_i%29+%5Cint_0%5E%7B%5Ctau_%7BK-1%7D%7D+h_%7BK-1%7D%28p_%7BK-1%7D%29+h_%7BK%7D%28%5Ctau_%7BK-1%7D+-+q_%7BK-1%7D%29+dp_%7BK-1%7D+dp_1+dp_2+%5Ccdots+dp_%7BK-2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;int_0^1 &#92;int_0^{&#92;tau_1} &#92;cdots &#92;int_0^{&#92;tau_{K-2}} &#92;prod_{i=1}^{K-2} h_i(p_i) &#92;int_0^{&#92;tau_{K-1}} h_{K-1}(p_{K-1}) h_{K}(&#92;tau_{K-1} - q_{K-1}) dp_{K-1} dp_1 dp_2 &#92;cdots dp_{K-2}' title='&#92;int_0^1 &#92;int_0^{&#92;tau_1} &#92;cdots &#92;int_0^{&#92;tau_{K-2}} &#92;prod_{i=1}^{K-2} h_i(p_i) &#92;int_0^{&#92;tau_{K-1}} h_{K-1}(p_{K-1}) h_{K}(&#92;tau_{K-1} - q_{K-1}) dp_{K-1} dp_1 dp_2 &#92;cdots dp_{K-2}' class='latex' /></p>
<p>Recognizing the convolution, it can be simplified as,</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cint_0%5E1+%5Cint_0%5E%7B%5Ctau_1%7D+%5Ccdots+%5Cint_0%5E%7B%5Ctau_%7BK-2%7D%7D+%5Cprod_%7Bi%3D1%7D%5E%7BK-2%7D+h_i%28p_i%29+%28h_%7BK-1%7D+%5Cotimes+h_%7BK%7D%29%28%5Ctau_%7BK-1%7D%29+dp_1+dp_2+%5Ccdots+dp_%7BK-2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;int_0^1 &#92;int_0^{&#92;tau_1} &#92;cdots &#92;int_0^{&#92;tau_{K-2}} &#92;prod_{i=1}^{K-2} h_i(p_i) (h_{K-1} &#92;otimes h_{K})(&#92;tau_{K-1}) dp_1 dp_2 &#92;cdots dp_{K-2}' title='&#92;int_0^1 &#92;int_0^{&#92;tau_1} &#92;cdots &#92;int_0^{&#92;tau_{K-2}} &#92;prod_{i=1}^{K-2} h_i(p_i) (h_{K-1} &#92;otimes h_{K})(&#92;tau_{K-1}) dp_1 dp_2 &#92;cdots dp_{K-2}' class='latex' /></p>
<p>Using <img src='http://s0.wp.com/latex.php?latex=%5Ctau_%7Bj%7D+%3D+%5Ctau_%7Bj-1%7D+-+p_%7Bj-1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;tau_{j} = &#92;tau_{j-1} - p_{j-1}' title='&#92;tau_{j} = &#92;tau_{j-1} - p_{j-1}' class='latex' />,</p>
<div id="_mcePaste"><img src='http://s0.wp.com/latex.php?latex=%5Cint_0%5E1+%5Cint_0%5E%7B%5Ctau_1%7D+%5Ccdots+%5Cint_0%5E%7B%5Ctau_%7BK-3%7D%7D+%5Cprod_%7Bi%3D1%7D%5E%7BK-3%7D+h_i%28p_i%29+%5Cint_0%5E%7B%5Ctau_%7BK-2%7D%7D+h_%7BK-2%7D%28p_%7BK-2%7D%29%28h_%7BK-1%7D+%5Cotimes+h_%7BK%7D%29%28%5Ctau_%7BK-2%7D+-+p_%7BK-2%7D%29+dp_%7BK-2%7D+dp_1+dp_2+%5Ccdots+dp_%7BK-3%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;int_0^1 &#92;int_0^{&#92;tau_1} &#92;cdots &#92;int_0^{&#92;tau_{K-3}} &#92;prod_{i=1}^{K-3} h_i(p_i) &#92;int_0^{&#92;tau_{K-2}} h_{K-2}(p_{K-2})(h_{K-1} &#92;otimes h_{K})(&#92;tau_{K-2} - p_{K-2}) dp_{K-2} dp_1 dp_2 &#92;cdots dp_{K-3}' title='&#92;int_0^1 &#92;int_0^{&#92;tau_1} &#92;cdots &#92;int_0^{&#92;tau_{K-3}} &#92;prod_{i=1}^{K-3} h_i(p_i) &#92;int_0^{&#92;tau_{K-2}} h_{K-2}(p_{K-2})(h_{K-1} &#92;otimes h_{K})(&#92;tau_{K-2} - p_{K-2}) dp_{K-2} dp_1 dp_2 &#92;cdots dp_{K-3}' class='latex' /></div>
<div></div>
<div id="_mcePaste">Again, the convolution form is there. By induction, we end up with,</div>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cleft%28+%5Cotimes_%7Bi%3D1%7D%5E%7BK%7D+h_i+%5Cright%29%281%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;left( &#92;otimes_{i=1}^{K} h_i &#92;right)(1)' title='&#92;left( &#92;otimes_{i=1}^{K} h_i &#92;right)(1)' class='latex' /></p>
<p>Now, just apply the Laplace transform, and you are good to go!<br />
This trick is useful for Bayesian integrals on Dirichlet prior.</p>
<p>Reference</p>
<ul>
<li> David Wolpert, David Wolf. <a href="http://dx.doi.org/10.1103/PhysRevE.52.6841">Estimating functions of probability distributions from a finite set of samples</a>. Physical Review E, Vol. 52, No. 6. (December 1995), pp. 6841-6854.</li>
</ul>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/memming.wordpress.com/413/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/memming.wordpress.com/413/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/memming.wordpress.com/413/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/memming.wordpress.com/413/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/memming.wordpress.com/413/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/memming.wordpress.com/413/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/memming.wordpress.com/413/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/memming.wordpress.com/413/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/memming.wordpress.com/413/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/memming.wordpress.com/413/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/memming.wordpress.com/413/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/memming.wordpress.com/413/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/memming.wordpress.com/413/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/memming.wordpress.com/413/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=413&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://memming.wordpress.com/2010/09/09/integration-of-product-of-functions-on-a-probability-simplex/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/aab80aeaf4754dee5d4a758db50e7213?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">memming</media:title>
		</media:content>
	</item>
		<item>
		<title>Removing ensemble mean introduces correlation</title>
		<link>http://memming.wordpress.com/2010/08/31/removing-ensemble-mean-introduces-correlation/</link>
		<comments>http://memming.wordpress.com/2010/08/31/removing-ensemble-mean-introduces-correlation/#comments</comments>
		<pubDate>Tue, 31 Aug 2010 23:03:01 +0000</pubDate>
		<dc:creator>memming</dc:creator>
				<category><![CDATA[Science and Engineering]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[MATLAB code]]></category>
		<category><![CDATA[signal processing]]></category>

		<guid isPermaLink="false">http://memming.wordpress.com/?p=387</guid>
		<description><![CDATA[Ensemble mean removal is often performed in multi-variate time series analysis, when one suspects global instantaneous fluctuation of signal is additively introduced, and wants to remove it. For example, if a time series of images are provided as the signal, there may be additional uncontrolled light source mixed with the intended signal, and it is [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=387&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Ensemble mean removal is often performed in multi-variate time series analysis, when one suspects global instantaneous fluctuation of signal is additively introduced, and wants to remove it. For example, if a time series of images are provided as the signal, there may be additional uncontrolled light source mixed with the intended signal, and it is desirable to remove this effect. Ensemble mean removal in the simplest case can be simply done by taking the instantaneous temporal mean, and subtracting it from each channel. When the gain of each channel is assumed to be heterogeneous (but with same sign), one can still take the ensemble mean and compute the optimal gain for each channel. When taking the ensemble mean, the assumptions is that the time-locked common component to signal ratio increases, so given enough channels (at least more than 10).</p>
<p>Note that if <img src='http://s0.wp.com/latex.php?latex=Y_i%28t%29+%3D+X_i%28t%29+%2B+%5Calpha_i+%5Ceta%28t%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Y_i(t) = X_i(t) + &#92;alpha_i &#92;eta(t)' title='Y_i(t) = X_i(t) + &#92;alpha_i &#92;eta(t)' class='latex' /> where <img src='http://s0.wp.com/latex.php?latex=Y_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Y_i' title='Y_i' class='latex' /> is the observation from channel <em>i</em>, <img src='http://s0.wp.com/latex.php?latex=X_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X_i' title='X_i' class='latex' /> is a zero mean random process that is spatially independent, and <img src='http://s0.wp.com/latex.php?latex=%5Ceta%28t%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;eta(t)' title='&#92;eta(t)' class='latex' /> is the fixed realization of a zero mean random process with variance <img src='http://s0.wp.com/latex.php?latex=%5Csigma%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sigma^2' title='&#92;sigma^2' class='latex' /> fluctuation among the channels, the instantaneous cross-correlation is <img src='http://s0.wp.com/latex.php?latex=E%5Cleft%5BY_i+Y_j%5Cright%5D+%3D+%5Calpha_i+%5Calpha_j+%5Csigma%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E&#92;left[Y_i Y_j&#92;right] = &#92;alpha_i &#92;alpha_j &#92;sigma^2' title='E&#92;left[Y_i Y_j&#92;right] = &#92;alpha_i &#92;alpha_j &#92;sigma^2' class='latex' />. Hence if <img src='http://s0.wp.com/latex.php?latex=%5Calpha_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;alpha_i' title='&#92;alpha_i' class='latex' /> are all positive (or all negative), such common additive fluctuation creates positive correlation.</p>
<p>However, if one analyzes the cross-correlation between channels after ensemble mean removal, one would find that there is a tendency that the cross-correlation at zero lag is smaller (often negative) than expected. In essence, this is kind of a small sample size effect. The problem is even when the <img src='http://s0.wp.com/latex.php?latex=%5Calpha_i+%5Ceta%28t%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;alpha_i &#92;eta(t)' title='&#92;alpha_i &#92;eta(t)' class='latex' /> term is perfectly removed by the ensemble average subtraction, the empirical mean <img src='http://s0.wp.com/latex.php?latex=%5Csum_i+X_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_i X_i' title='&#92;sum_i X_i' class='latex' /> is not zero, and again we are removing this from each channel as well. This can be demonstrated in the simple case where the underlying process are independent and have the same variance, then</p>
<p style="padding-left:30px;"><img src='http://s0.wp.com/latex.php?latex=E%5Cleft%5B+%5Cleft%28X_i+-+%5Cfrac%7B1%7D%7BN%7D+%5Csum_i+X_i%5Cright%29+%5Cleft%28X_j+-+%5Cfrac%7B1%7D%7BN%7D+%5Csum_i+X_i%5Cright%29+%5Cright%5D+%3D+-%5Cfrac%7BE%5BX_i%5E2%5D%7D%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E&#92;left[ &#92;left(X_i - &#92;frac{1}{N} &#92;sum_i X_i&#92;right) &#92;left(X_j - &#92;frac{1}{N} &#92;sum_i X_i&#92;right) &#92;right] = -&#92;frac{E[X_i^2]}{N}' title='E&#92;left[ &#92;left(X_i - &#92;frac{1}{N} &#92;sum_i X_i&#92;right) &#92;left(X_j - &#92;frac{1}{N} &#92;sum_i X_i&#92;right) &#92;right] = -&#92;frac{E[X_i^2]}{N}' class='latex' />.</p>
<p>Therefore ensemble mean introduces negative correlation. The following MATLAB code demonstrates it.</p>
<div id="attachment_399" class="wp-caption alignright" style="width: 310px"><a href="http://memming.files.wordpress.com/2010/08/xcorr_demo.png"><img class="size-medium wp-image-399  " title="Cross correlations" src="http://memming.files.wordpress.com/2010/08/xcorr_demo.png?w=300&#038;h=225" alt="Demonstration of ensemble mean removal to cross-correlation" width="300" height="225" /></a><p class="wp-caption-text">The blue line corresponds to the independent signal, the green the common fluctuation added, and the red the ensemble removed.</p></div>
<p><code>N = 1000; % length of time series<br />
M = 20; % number of channels<br />
eta = 0.1 * randn(N,1);<br />
X = randn(N,M);<br />
Y = X + repmat(eta,1,M);<br />
esbm = mean(Y,2); % compute the ensemble mean<br />
Z = Y - repmat(esbm,1,M);<br />
figure; hold all;<br />
[mcc,lags] = meanXcorr(X,10); plot(lags, mcc);<br />
[mcc,lags] = meanXcorr(Y,10); plot(lags, mcc);<br />
[mcc,lags] = meanXcorr(Z,10); plot(lags, mcc);<br />
legend('X', 'Y', 'Z');</code><br />
where the function meanXcor is simply computes the mean pairwise cross-correlation.<br />
<code>function [mcc, lags] = meanXcorr(X, maxLag)<br />
k = 0;<br />
for n1 = 1:size(X,2)<br />
for n2 = n1+1:size(X,2)<br />
k = k + 1;<br />
[cc(:,k), lags] = xcorr(X(:,n1), X(:,n2), maxLag, 'coeff');<br />
end<br />
end<br />
mcc = mean(cc,2);</code></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/memming.wordpress.com/387/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/memming.wordpress.com/387/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/memming.wordpress.com/387/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/memming.wordpress.com/387/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/memming.wordpress.com/387/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/memming.wordpress.com/387/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/memming.wordpress.com/387/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/memming.wordpress.com/387/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/memming.wordpress.com/387/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/memming.wordpress.com/387/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/memming.wordpress.com/387/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/memming.wordpress.com/387/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/memming.wordpress.com/387/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/memming.wordpress.com/387/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=387&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://memming.wordpress.com/2010/08/31/removing-ensemble-mean-introduces-correlation/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/aab80aeaf4754dee5d4a758db50e7213?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">memming</media:title>
		</media:content>

		<media:content url="http://memming.files.wordpress.com/2010/08/xcorr_demo.png?w=300" medium="image">
			<media:title type="html">Cross correlations</media:title>
		</media:content>
	</item>
		<item>
		<title>An engineering introduction to measure theory</title>
		<link>http://memming.wordpress.com/2010/07/18/an-engineering-introduction-to-measure-theory/</link>
		<comments>http://memming.wordpress.com/2010/07/18/an-engineering-introduction-to-measure-theory/#comments</comments>
		<pubDate>Sun, 18 Jul 2010 21:28:40 +0000</pubDate>
		<dc:creator>memming</dc:creator>
				<category><![CDATA[Mathematics]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[measure theory]]></category>

		<guid isPermaLink="false">http://memming.wordpress.com/?p=361</guid>
		<description><![CDATA[A measure is a convenient mathematical object (function) that can represent the positions of strawberries in a field, distribution of water in the ocean, or probabilities of winning over the lottery numbers — the measure counts the number of strawberries in a given area, reports the amount of water in a certain sea, and evaluates the probability [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=361&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>A measure is a convenient mathematical object (function) that can represent the positions of strawberries in a field, distribution of water in the ocean, or probabilities of winning over the lottery numbers — the measure counts the number of strawberries in a given area, reports the amount of water in a certain sea, and evaluates the probability of a lottery ticket to win. This abstract unifying framework enables one to rigorously &#8216;measure&#8217; quantities over a space, and also enable integration. It also allows elegant notation for probability theory. Here we briefly describe key ideas of <a href="http://en.wikipedia.org/wiki/Measure_(mathematics)">measure theory</a> without proof. This material is mostly based on references [1] and [2].</p>
<p>To define a measure, we need a <strong>measurable space</strong> (X, F); a non-empty set X and a σ-algebra F on X. Here X is the space where our stuff to be measured lies, and F gives special structure of the space such that things are well defined and <a href="http://en.wikipedia.org/wiki/Non-measurable_set">pathological sets</a> can be avoided. An <strong>algebra</strong> F of  X is a set of subsets of X such that it contains the empty set, and closed under set union and complement. A <strong>σ-algebra</strong> is an algebra that is closed under countable union. Elements in F are said to be <strong>measurable</strong>.</p>
<p>A <strong>measure</strong> μ on (X, F) is a non-negative extended real valued function on F that is countably additive;</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cmu+%5Cleft%28+%5Ccup_i%5E%5Cinfty+A_i+%5Cright%29+%3D+%5Csum_i%5E%5Cinfty+%5Cmu+%28A_i%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mu &#92;left( &#92;cup_i^&#92;infty A_i &#92;right) = &#92;sum_i^&#92;infty &#92;mu (A_i)' title='&#92;mu &#92;left( &#92;cup_i^&#92;infty A_i &#92;right) = &#92;sum_i^&#92;infty &#92;mu (A_i)' class='latex' /></p>
<p>where <img src='http://s0.wp.com/latex.php?latex=A_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A_i' title='A_i' class='latex' /> are disjoint measurable sets. Additivity makes sense because we want <img src='http://s0.wp.com/latex.php?latex=%5Cmu%28A%29+%2B+%5Cmu%28B%29+%3D+%5Cmu%28A+%5Ccup+B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mu(A) + &#92;mu(B) = &#92;mu(A &#92;cup B)' title='&#92;mu(A) + &#92;mu(B) = &#92;mu(A &#92;cup B)' class='latex' /> when A and B are disjoint (number of strawberries should add up for different fields). If μ is always finite valued, then it is called a <strong>finite measure</strong>. For a special case, a probability measure is a finite measure where μ(X) = 1.</p>
<p>Given a set E, we denote the smallest σ-algebra that contains E as <strong>σ(E)</strong> and say σ(E) is the σ-algebra generated by E. A measure μ on F is determined by its values on any algebra that generates F (<a href="http://en.wikipedia.org/wiki/Carath%C3%A9odory's_extension_theorem">Carathéodory extension theorem</a>).</p>
<p>A predicate P(x) holds <strong>almost everywhere μ</strong> (or <strong>μ-a.e.</strong> for short) if it is true except for a set of measure zero, that is, <img src='http://s0.wp.com/latex.php?latex=%5Cmu%28E%29+%3D+0%2C+%5Cforall+x+%5Cin+E%5E%7Bc%7D%2C+P%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mu(E) = 0, &#92;forall x &#92;in E^{c}, P(x)' title='&#92;mu(E) = 0, &#92;forall x &#92;in E^{c}, P(x)' class='latex' />.</p>
<p>A function f from a measurable space (X, F) to a measurable space (Y, G) is <strong>measurable</strong> if <img src='http://s0.wp.com/latex.php?latex=%5Cforall+E+%5Cin+%5Cmathcal%7BG%7D%2C+f%5E%7B-1%7D%28E%29+%5Cin+%5Cmathcal%7BF%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;forall E &#92;in &#92;mathcal{G}, f^{-1}(E) &#92;in &#92;mathcal{F}' title='&#92;forall E &#92;in &#92;mathcal{G}, f^{-1}(E) &#92;in &#92;mathcal{F}' class='latex' />. <span style="font-size:13.1944px;">For a topological space (X, U), the σ-algebra generated by the open sets is called the <strong>Borel σ-algebra</strong> (or <a href="http://en.wikipedia.org/wiki/Borel_set">Borel set</a>). Borel set links the measurable functions and continuous functions — every continuous function from X to the real line is measurable with respect to the Borel algebra. W</span><span style="font-size:13.1944px;">hen the topological space is induced by a metric, Borel set and Baire set coincide. <strong>Baire set</strong> is the smallest σ-algebra with respect to which the continuous functions are measurable.</span></p>
<p>Real-valued Borel-measurable functions are closed under algebraic operations and limit. Moreover, they can be approximated by limit of simple functions. A simple function is a finite linear combination of indicator function of measurable sets. By the linearity of integration, integration of a Borel-measurable function with respect to a measure μ can be defined by letting the integration of an indicator function on the measurable set A as μ(A): <img src='http://s0.wp.com/latex.php?latex=%5Cint+%5Cmathbb%7BI%7D_%7BA%7D+%5Cmathrm%7Bd%7D%5Cmu+%3D+%5Cmu%28A%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;int &#92;mathbb{I}_{A} &#92;mathrm{d}&#92;mu = &#92;mu(A)' title='&#92;int &#92;mathbb{I}_{A} &#92;mathrm{d}&#92;mu = &#92;mu(A)' class='latex' />.</p>
<p>References:</p>
<ol>
<li><span style="font-family:arial;">Halmos, P. R. Measure theory. Springer-Verlag<em>, </em>1974</span></li>
<li><span style="font-family:arial;">Daley, D. J. &amp; Vere-Jones, D. An Introduction to the Theory of Point Processes. Springer, 1988</span></li>
</ol>
<p>P.S. Terry Tao has a <a href="http://terrytao.wordpress.com/2009/01/01/245b-notes-0-a-quick-review-of-measure-and-integration-theory/">great summary on measure theory and integration</a>.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/memming.wordpress.com/361/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/memming.wordpress.com/361/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/memming.wordpress.com/361/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/memming.wordpress.com/361/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/memming.wordpress.com/361/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/memming.wordpress.com/361/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/memming.wordpress.com/361/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/memming.wordpress.com/361/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/memming.wordpress.com/361/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/memming.wordpress.com/361/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/memming.wordpress.com/361/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/memming.wordpress.com/361/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/memming.wordpress.com/361/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/memming.wordpress.com/361/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=361&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://memming.wordpress.com/2010/07/18/an-engineering-introduction-to-measure-theory/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/aab80aeaf4754dee5d4a758db50e7213?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">memming</media:title>
		</media:content>
	</item>
		<item>
		<title>Higher-order Spike Synchrony</title>
		<link>http://memming.wordpress.com/2010/05/19/higher-order-spike-synchrony/</link>
		<comments>http://memming.wordpress.com/2010/05/19/higher-order-spike-synchrony/#comments</comments>
		<pubDate>Wed, 19 May 2010 15:43:21 +0000</pubDate>
		<dc:creator>memming</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[CuBIC]]></category>
		<category><![CDATA[higher-order interaction]]></category>
		<category><![CDATA[neural assembly]]></category>
		<category><![CDATA[synchrony]]></category>

		<guid isPermaLink="false">http://memming.wordpress.com/?p=341</guid>
		<description><![CDATA[My recent visit to Sonja Grün&#8217;s lab made me thinking about statistical measures that would detect higher-order mutual interactions among neurons that would not be explained by lower-order interaction. If there are neural assemblies that work together, the detection of such mutually dependent neurons would be a key evidence. For synchrony (the simplest type of [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=341&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>My recent visit to <a href="http://www.cnpsn.brain.riken.jp/home/Main_Page">Sonja Grün&#8217;s lab</a> made me thinking about statistical measures that would detect higher-order mutual interactions among neurons that would not be explained by lower-order interaction. If there are neural assemblies that work together, the detection of such mutually dependent neurons would be a key evidence. For synchrony (the simplest type of dependence/interaction between spiking neurons) the difference between pairwise and triple synchrony can be illustrated by the following spike trains:</p>
<p><a href="http://memming.files.wordpress.com/2010/05/spike_trains_pairwise_synchronous1.png"><img class="aligncenter size-medium wp-image-344" title="spike_trains_pairwise_synchronous" src="http://memming.files.wordpress.com/2010/05/spike_trains_pairwise_synchronous1.png?w=300&#038;h=162" alt="" width="300" height="162" /></a></p>
<p>Spike train pairs(A, B) and (B, C) are synchronous as they have synchronized action potential timings (red and blue, respectively). However, A, B, C are not synchronous all together in this example. Hence, there&#8217;s a need for capturing this higher-order synchrony structure.</p>
<p>Even when the time window of analysis is small enough, such that in a single trial basis, synchrony in pairs (A,B) and (B,C) implies synchrony in (A,C) because the jitter allowed for synchrony detection is in the order of window size, pairwise synchrony does NOT imply higher-order synchrony because in some trials (A,B) may be synchronous and some other trials (B,C) may be synchronous.</p>
<p>Detection of mutual interaction of multiple neurons is a difficult problem, and we know pairwise analysis tools can only check necessary conditions but not sufficient. (Simple example: given 3 random variables, X, Y, Z, pairwise independence does not imply mutual independence, since P(X,Y,Z) = P(X)P(Y)P(Z) may not hold.)  Nevertheless most analysis are done with pairwise analysis these days. I was impressed by the work by Sonja&#8217;s group on CuBIC method to detect the presence of these higher order synchrony [1]. The main idea is that amplitude distribution of the population rate, they call &#8216;<em>complexity distribution</em>&#8216;, has a heavier tail when the higher order synchrony is present. The idea does not require a combinatorial search, and can be directly applied without spike sorting. The disadvantages are that the assumptions may not hold in real data, and only the presence is detected and the actual interaction is not directly captured.</p>
<p>References</p>
<ol>
<li>Benjamin Staude, Stefan Rotter, Sonja Grün. CuBIC: cumulant based inference of higher-order correlations in massively parallel spike trains. Journal of Computational Neuroscience (28 October 2009) <a href="http://dx.doi.org/10.1007/s10827-009-0195-x">dx.doi.org/10.1007/s10827-009-0195-x</a></li>
</ol>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/memming.wordpress.com/341/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/memming.wordpress.com/341/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/memming.wordpress.com/341/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/memming.wordpress.com/341/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/memming.wordpress.com/341/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/memming.wordpress.com/341/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/memming.wordpress.com/341/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/memming.wordpress.com/341/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/memming.wordpress.com/341/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/memming.wordpress.com/341/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/memming.wordpress.com/341/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/memming.wordpress.com/341/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/memming.wordpress.com/341/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/memming.wordpress.com/341/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=memming.wordpress.com&amp;blog=6664373&amp;post=341&amp;subd=memming&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://memming.wordpress.com/2010/05/19/higher-order-spike-synchrony/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/aab80aeaf4754dee5d4a758db50e7213?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">memming</media:title>
		</media:content>

		<media:content url="http://memming.files.wordpress.com/2010/05/spike_trains_pairwise_synchronous1.png?w=300" medium="image">
			<media:title type="html">spike_trains_pairwise_synchronous</media:title>
		</media:content>
	</item>
	</channel>
</rss>
