Video Scene Analysis: A Machine Learning Perspective - Video Segmentation and Its Applications

Digital Signal Processing Reference

In-Depth Information

technologies. Clearly, this chapter is only meant to capture the landscape of the

field that is still young and still evolving. For a long-term perspective, video scene

analysis is an interesting issue that currently requires for a lot more research efforts.

Acknowledgement The work is supported by grants from the Chinese National Natural Science

Foundation under contract No. 60973055 and No. 61035001, and National Basic Research Pro-

gram of China under contract No. 2009CB320906.

References

1. S. Aksoy, K. Koperski, C. Tusk, G. Marchisio, and J.C. Tilton, “Learning Bayesian classifiers

for scene classification with a visual grammar,” IEEE Trans. Geoscience and Remote Sensing,

vol. 43, no. 3, pp. 581-589, 2005.

2. Y. Altun, I. Tsochantaridis, and T. Hofman, “Hidden Markov support vector machines,” in

Proc. IEEE Int. Conf. Mechine Learning, 2003, pp. 3-10.

3. K. Barnard, P. Duygulu, N. de Freitas, D. Forsyth, D. Blei, and M. I. Jordan, “Matching words

and pictures,” J. Machine Learning Research, vol 3, pp. 1107-1135, 2003.

4. S. Boyd and L. Vandenberghe. Convex Optimization. Cambridge University Press, 2004.

5. N. D. Bruce and J. K. Tsotsos. Saliency based on information maximization. In Advances in

neural information processing systems, pp. 155-162, 2006.

6. M. Cerf, J. Harel, W. Einhauser, and C. Koch, Predicting human gaze using low-level saliency

combined with face detection, in Advances in Neural Information Processing Systems, 2008,

pp. 241-248.

7. Dai, J., Duan, L., Tong, X., Xu, C., Tian, Q., Lu, H., and Jin, J. 2005. Replay scene classification

in soccer video using web broadcast text. In Proc. IEEE ICME. 1098-1101.

8. L. Duan, I.W. Tsang, D. Xu, and S.J. Maybank, “Domain transfer SVM for video concept

detection,” in Proc. IEEE Int. Conf. Computer Vision and Pattern Recognition, 2009, pp. 1-8.

9. S. Ebadollahi, L. Xie, S.-F., Chang, and J.R. Smith, “Visual event detection using multidimen-

sional concept dynamics,” in Proc. IEEE Int. Conf. Multimedia and Expo, 2006, pp. 881-884.

10. C. Frith. The top in top-down attention. In Neurobiology of attention (pp. 105-108), 2005.

11. Wen Gao, Yonghong Tian, Tiejun Huang, Qiang Yang. Vlogging: A Survey of Video Blogging

Technology on the Web. ACM Computing Survey, 2(4), Jun. 2010.

12. Gunawardana, A., Mahajan, M., Acero, A., and Platt, J. 2005. Hidden conditional random

fields for phone classification. In Proc. Interspeech. 1117-1120.

13. C. Guo, Q. Ma, and L. Zhang, Spatio-temporal saliency detection using phase spectrum of

quaternion fourier transform, in IEEE Conference on Computer Vision and Pattern Recogni-

tion, 2008.

14. J. S. Hare, P. H. Lewis, P. G. B. Enser and C. J. Sandom, “Mind the Gap: Another look at the

problem of the semantic gap in image retrieval,” Multimedia Content Analysis, Management

and Retrieval 2006, vol. 6073, No. 1, 2006, San Jose, CA, USA.

15. J. Harel, C. Koch, and P. Perona, Graph-based visual saliency, in Advances in Neural Informa-

tion Processing Systems, 2007, pp. 545-552.

16. X. Hou and L. Zhang, Saliency detection: A spectral residual approach, in IEEE Conference

on Computer Vision and Pattern Recognition, 2007.

17. H. Hsu, L. Kennedy, and S. F. Chang, “Video search reranking through random walk over

document-level context graph,” in Proc. ACM Multimedia, 2007, pp. 971-980.

18. Y. Hu, D. Rajan, and L.-T. Chia, Robust subspace analysis for detecting visual attention regions

in images, in ACM International Conference on Multimedia, 2005, pp. 716-724.

19. L. Itti and C. Koch, Computational modeling of visual attention, Nature Review Neuroscience,

vol. 2, no. 3, pp. 194-203, 2001.

Video Segmentation and Its Applications

Search WWH ::

Custom Search

Home