2007年5月8日 星期二

Annotation Difficulty

We face some problems when we do manual annotation. Should we label
  1. music/non-music?
  2. singing/non-singing?
  3. music with singing/music without sining/no music
  4. singing with music/singing without music/no singing
  5. ...... something else
One of a reason why we're doubting is because we don't know "two adaBoost" or "one MultiBoost" would be better to identify A, B, and C. Note: the A here may include B, and vice versa.

Also, another question is, how can we handle the no-vocal part between two sentences? Is it music with singing or pure-music? Would it influence our training results?

2 則留言:

vagante 提到...

we handle all these questions in the way as below:

1 pure music
2 music with singing
3 ambiguous
4 nonmusic

and if two singing sequences are separated too far, then we separate them with a pure music sequence, or a ambiguous one, which depends on cases.

winston 提到...

so you are doing multiple-class discrimination. You can simply use the "One-against-all" approach as I mentioned in the lecture. SVM or Adaboost either one is OK for building such discriminative models.