From: Iris.Adam on
HI everybody...
I'm brand new to this community and also new to DSP, and this is my first
time asking questions in any community so forgive me for my long questions,
I have some questions I appreciate if you could help me.

I calculated MFCC but I think it is incorrect i'll explain the way i did
it so if you can tell me if it is correct or not,and if not how to fix it.
Actually I don't understand it well, although i read many articles about
it. In the steps after pre-emphasizing the data, for example windoing, fft
etc. i take it for each block like this

- pre-emphasize all the signal
- for each overlapped frame do
take hamming window, and put result in array (h)
take FFT for the data in (h)
find magnitude of the fft data
apply triangular window to the magnitude data
find summation of the previous step
find log of the summation
apply DCT to the log data (this will be the cepstrum)
next overlapped frame
- calculate Nth coefficient by summing Nth value of each frame
- then find Delta and DeltaDelta by these two equations
DeltaC(k)= C(k+2) - C(k-2)
DeltaDelta(k) = C(k+1) - C(k-1)

Is this correct or just I have to apply the hamming to the whole data
instead of each frame and then calculate Fourier?
Please help.......

I have some other questions:
1- How to decide number of triangular windows in the mel scale step, it
depends on what?

2- the equation of mel scale
mel= 2595 + log(f/ 7000 + 1)
where will be used, in the articles I read i didn't understand when
to use it, is it used before triangular windowing?
3- In calculating triangular windows, when says if m[f-1] > k and k <
m[f]
what is k? is it the frequency values(ones we obtained from fft) or it

is just the iteration control variable?
because when i multiply this to the magnitude value and find the
summation I get negative values, which means its wrong
4- is there any way to ensure that the results are correct
5- this one is about NN. how to determine number of inputs to the NN ( I
use back-propagation NN)


From: Jerry Avins on
Iris.Adam wrote:
> HI everybody...
> I'm brand new to this community and also new to DSP, and this is my first
> time asking questions in any community so forgive me for my long questions,
> I have some questions I appreciate if you could help me.
>
> I calculated MFCC ...

You left something out. What is MFCC? Read
http://users.erols.com/jyavins/procfaq.htm

Jerry
--
Engineering is the art of making what you want from things you can get.
�����������������������������������������������������������������������
From: Richard Dobson on
On 24/01/2010 14:25, Jerry Avins wrote:
> Iris.Adam wrote:
>> HI everybody...
>> I'm brand new to this community and also new to DSP, and this is my first
>> time asking questions in any community so forgive me for my long
>> questions,
>> I have some questions I appreciate if you could help me.
>> I calculated MFCC ...
>
> You left something out. What is MFCC? Read
> http://users.erols.com/jyavins/procfaq.htm
>
> Jerry

Mel-Frequency Cepstral Coefficients. Form of filtering. Used in speech
recognition especially, but also in sound description and classification
generally, e.g. MPEG-7 'n stuff. It has been discussed before here.
Google is... etc.

Seems to me the OP's question ticks all the relevant boxes in that faq;
but the faq doesn't address the issue of what dsp topics/acronyms must
first be explained to the group (again).

That said, there may well be more people on the musicdsp list who have
practical experience of using MFCCs.


Richard Dobson