From dmorton at bitfurnace.com Sat Apr 26 12:04:16 2014
From: dmorton at bitfurnace.com (Damien Morton)
Date: Sat, 26 Apr 2014 15:04:16 -0400
Subject: [FoRK] Kernel Density Technique (Parzen Window) help
Message-ID:
Hi everyone,
I am trying to implement a technique described in the following paper,
http://www.tcs-trddc.com/trddc_website/pdf/SRL/Palshikar_SAPDTS_2009.pdf
I have struck a problem and am looking for some help.
on page 7, function S4 is defined
Part of that is function pw(ai)
The right hand term inside the summation has a denominator |a[i] - a[i+w]|
The kernel density technique (also called Parzen window) is being used
The problem is that the denominator is 0 when a[i]==a[i+w] - I want to know
how best to handle this event
now, I have searched all kinds of material on the kernel density technique
and nowhere do i see the denominator defined this way
from what I can see, the denominator is usually expressed as a constant
proportional to the standard deviation of the samples
I _think_ the term |a[i] - a[i+w]| is being used as some kind of adaptive
kernel width. Its really not clear to me why this term is being used and
what it means
I tried contacting the author, but got no response
If anyone can point me in the right direction or give me some advice, t
would be very much appreciated
From sdw at lig.net Sun Apr 27 11:58:13 2014
From: sdw at lig.net (Stephen D. Williams)
Date: Sun, 27 Apr 2014 11:58:13 -0700
Subject: [FoRK] Kernel Density Technique (Parzen Window) help
In-Reply-To:
References:
Message-ID: <535D5345.5000002@lig.net>
The Wand and Jones (1995) must refer to this:
https://encrypted.google.com/books?id=GTOOi5yE008C&q=parzen+window#v=onepage&q=parzen%20window&f=false
Page 44 of that book, in the free Google view of it, mentions the use of the mean value theorem although that appears to be just one
step.
https://en.wikipedia.org/wiki/Mean_value_theorem
It seems like the most likely meaning of p sub(w) (a sub(i) ) ... |a sub(i) - a sub(i+w)|, is that w=window. It isn't clear what
was meant by the denominator. Doesn't seem workable as is.
Not only does the following explain kernel density estimation well, but it points to a numpy function that computes with a Gaussian
kernel and determines the bandwidth (i.e. w) automatically:
https://en.wikipedia.org/wiki/Kernel_density_estimation
And, rug plot!
See also:
http://research.cs.tamu.edu/prism/lectures/pr/pr_l7.pdf
https://www.cs.utah.edu/~suyash/Dissertation_html/node11.html
See section 4 here. Also good illustration of different kernels.
http://arxiv.org/pdf/1212.2812.pdf
Looks interesting for both machine learning and image cleanup / compression / understanding. Although most of those methods already
have much more efficient ways of solving an equivalent problem. You might want to look at both machine learning and image
processing algorithms to do "peak detection in time-series".
Really interesting:
https://en.wikipedia.org/wiki/Manifold_learning#Manifold_learning_algorithms
http://iihm.imag.fr/daassi/papierInfoVisJournal/papierExemple/9500051a.pdf
http://people.csail.mit.edu/rosman/tcie_ijcv.pdf
http://tx.technion.ac.il/~rc/diffusion_maps.pdf
http://shogun-toolbox.org/
General purpose machine learning toolkit:
http://waffles.sourceforge.net/
sdw
On 4/26/14, 12:04 PM, Damien Morton wrote:
> Hi everyone,
>
> I am trying to implement a technique described in the following paper,
>
> http://www.tcs-trddc.com/trddc_website/pdf/SRL/Palshikar_SAPDTS_2009.pdf
>
> I have struck a problem and am looking for some help.
>
> on page 7, function S4 is defined
>
> Part of that is function pw(ai)
>
> The right hand term inside the summation has a denominator |a[i] - a[i+w]|
>
> The kernel density technique (also called Parzen window) is being used
>
> The problem is that the denominator is 0 when a[i]==a[i+w] - I want to know
> how best to handle this event
>
> now, I have searched all kinds of material on the kernel density technique
> and nowhere do i see the denominator defined this way
>
> from what I can see, the denominator is usually expressed as a constant
> proportional to the standard deviation of the samples
>
> I _think_ the term |a[i] - a[i+w]| is being used as some kind of adaptive
> kernel width. Its really not clear to me why this term is being used and
> what it means
>
> I tried contacting the author, but got no response
>
> If anyone can point me in the right direction or give me some advice, t
> would be very much appreciated
From wkearney99 at hotmail.com Sun Apr 27 14:14:08 2014
From: wkearney99 at hotmail.com (Bill Kearney)
Date: Sun, 27 Apr 2014 17:14:08 -0400
Subject: [FoRK] O2 seats?
Message-ID:
I'm debating going to the Monty Python show at the O2 in London. Anyone
have opinions on what seating areas to get/avoid?
-Bill Kearney
From dmorton at bitfurnace.com Sun Apr 27 19:59:48 2014
From: dmorton at bitfurnace.com (Damien Morton)
Date: Sun, 27 Apr 2014 22:59:48 -0400
Subject: [FoRK] Kernel Density Technique (Parzen Window) help
In-Reply-To: <535D5345.5000002@lig.net>
References:
<535D5345.5000002@lig.net>
Message-ID:
Hi Stephen,
Thanks for the pointer to The Wand and Jones book.
It appears that the denominator is indeed an attempt at a adaptive kernel
width.
On page 43 they describe variable kernel estimators, and the denominator
from my paper corresponds to Wan and Jone's alpha(Xi) term.
"... suggests that each alpha(Xi) term should depend on the true density in
a roughly inverse way"
It appears that the |Xi - Xi+w| term is a crude attempt to estimate the
density around Xi
I had started to experiment with calculating the variance of the values in
the Xi....i+w window, and using that as the denominator, but I didnt
understand what I was doing. Now I have some basis to continue that.
Plus, the book also talks about clipping for extreme values, and then says
its out of the scope of the book. Harumph.
On Sun, Apr 27, 2014 at 2:58 PM, Stephen D. Williams wrote:
> The Wand and Jones (1995) must refer to this:
> https://encrypted.google.com/books?id=GTOOi5yE008C&q=
> parzen+window#v=onepage&q=parzen%20window&f=false
>
> Page 44 of that book, in the free Google view of it, mentions the use of
> the mean value theorem although that appears to be just one step.
> https://en.wikipedia.org/wiki/Mean_value_theorem
>
> It seems like the most likely meaning of p sub(w) (a sub(i) ) ... |a
> sub(i) - a sub(i+w)|, is that w=window. It isn't clear what was meant by
> the denominator. Doesn't seem workable as is.
>
> Not only does the following explain kernel density estimation well, but it
> points to a numpy function that computes with a Gaussian kernel and
> determines the bandwidth (i.e. w) automatically:
> https://en.wikipedia.org/wiki/Kernel_density_estimation
> And, rug plot!
>
> See also:
> http://research.cs.tamu.edu/prism/lectures/pr/pr_l7.pdf
> https://www.cs.utah.edu/~suyash/Dissertation_html/node11.html
>
> See section 4 here. Also good illustration of different kernels.
> http://arxiv.org/pdf/1212.2812.pdf
>
> Looks interesting for both machine learning and image cleanup /
> compression / understanding. Although most of those methods already have
> much more efficient ways of solving an equivalent problem. You might want
> to look at both machine learning and image processing algorithms to do
> "peak detection in time-series".
>
> Really interesting:
> https://en.wikipedia.org/wiki/Manifold_learning#Manifold_
> learning_algorithms
>
> http://iihm.imag.fr/daassi/papierInfoVisJournal/papierExemple/9500051a.pdf
> http://people.csail.mit.edu/rosman/tcie_ijcv.pdf
> http://tx.technion.ac.il/~rc/diffusion_maps.pdf
>
> http://shogun-toolbox.org/
>
> General purpose machine learning toolkit:
> http://waffles.sourceforge.net/
>
> sdw
>
>
> On 4/26/14, 12:04 PM, Damien Morton wrote:
>
>> Hi everyone,
>>
>> I am trying to implement a technique described in the following paper,
>>
>> http://www.tcs-trddc.com/trddc_website/pdf/SRL/Palshikar_SAPDTS_2009.pdf
>>
>> I have struck a problem and am looking for some help.
>>
>> on page 7, function S4 is defined
>>
>> Part of that is function pw(ai)
>>
>> The right hand term inside the summation has a denominator |a[i] - a[i+w]|
>>
>> The kernel density technique (also called Parzen window) is being used
>>
>> The problem is that the denominator is 0 when a[i]==a[i+w] - I want to
>> know
>> how best to handle this event
>>
>> now, I have searched all kinds of material on the kernel density
>> technique
>> and nowhere do i see the denominator defined this way
>>
>> from what I can see, the denominator is usually expressed as a constant
>> proportional to the standard deviation of the samples
>>
>> I _think_ the term |a[i] - a[i+w]| is being used as some kind of
>> adaptive
>> kernel width. Its really not clear to me why this term is being used and
>> what it means
>>
>> I tried contacting the author, but got no response
>>
>> If anyone can point me in the right direction or give me some advice, t
>> would be very much appreciated
>>
>
>
> _______________________________________________
> FoRK mailing list
> http://xent.com/mailman/listinfo/fork
>