Digital Signal Processing Reference
In-Depth Information
technologies. Clearly, this chapter is only meant to capture the landscape of the
field that is still young and still evolving. For a long-term perspective, video scene
analysis is an interesting issue that currently requires for a lot more research efforts.
Acknowledgement The work is supported by grants from the Chinese National Natural Science
Foundation under contract No. 60973055 and No. 61035001, and National Basic Research Pro-
gram of China under contract No. 2009CB320906.
References
1. S. Aksoy, K. Koperski, C. Tusk, G. Marchisio, and J.C. Tilton, “Learning Bayesian classifiers
for scene classification with a visual grammar,” IEEE Trans. Geoscience and Remote Sensing,
vol. 43, no. 3, pp. 581-589, 2005.
2. Y. Altun, I. Tsochantaridis, and T. Hofman, “Hidden Markov support vector machines,” in
Proc. IEEE Int. Conf. Mechine Learning, 2003, pp. 3-10.
3. K. Barnard, P. Duygulu, N. de Freitas, D. Forsyth, D. Blei, and M. I. Jordan, “Matching words
and pictures,” J. Machine Learning Research, vol 3, pp. 1107-1135, 2003.
4. S. Boyd and L. Vandenberghe. Convex Optimization. Cambridge University Press, 2004.
5. N. D. Bruce and J. K. Tsotsos. Saliency based on information maximization. In Advances in
neural information processing systems, pp. 155-162, 2006.
6. M. Cerf, J. Harel, W. Einhauser, and C. Koch, Predicting human gaze using low-level saliency
combined with face detection, in Advances in Neural Information Processing Systems, 2008,
pp. 241-248.
7. Dai, J., Duan, L., Tong, X., Xu, C., Tian, Q., Lu, H., and Jin, J. 2005. Replay scene classification
in soccer video using web broadcast text. In Proc. IEEE ICME. 1098-1101.
8. L. Duan, I.W. Tsang, D. Xu, and S.J. Maybank, “Domain transfer SVM for video concept
detection,” in Proc. IEEE Int. Conf. Computer Vision and Pattern Recognition, 2009, pp. 1-8.
9. S. Ebadollahi, L. Xie, S.-F., Chang, and J.R. Smith, “Visual event detection using multidimen-
sional concept dynamics,” in Proc. IEEE Int. Conf. Multimedia and Expo, 2006, pp. 881-884.
10. C. Frith. The top in top-down attention. In Neurobiology of attention (pp. 105-108), 2005.
11. Wen Gao, Yonghong Tian, Tiejun Huang, Qiang Yang. Vlogging: A Survey of Video Blogging
Technology on the Web. ACM Computing Survey, 2(4), Jun. 2010.
12. Gunawardana, A., Mahajan, M., Acero, A., and Platt, J. 2005. Hidden conditional random
fields for phone classification. In Proc. Interspeech. 1117-1120.
13. C. Guo, Q. Ma, and L. Zhang, Spatio-temporal saliency detection using phase spectrum of
quaternion fourier transform, in IEEE Conference on Computer Vision and Pattern Recogni-
tion, 2008.
14. J. S. Hare, P. H. Lewis, P. G. B. Enser and C. J. Sandom, “Mind the Gap: Another look at the
problem of the semantic gap in image retrieval,” Multimedia Content Analysis, Management
and Retrieval 2006, vol. 6073, No. 1, 2006, San Jose, CA, USA.
15. J. Harel, C. Koch, and P. Perona, Graph-based visual saliency, in Advances in Neural Informa-
tion Processing Systems, 2007, pp. 545-552.
16. X. Hou and L. Zhang, Saliency detection: A spectral residual approach, in IEEE Conference
on Computer Vision and Pattern Recognition, 2007.
17. H. Hsu, L. Kennedy, and S. F. Chang, “Video search reranking through random walk over
document-level context graph,” in Proc. ACM Multimedia, 2007, pp. 971-980.
18. Y. Hu, D. Rajan, and L.-T. Chia, Robust subspace analysis for detecting visual attention regions
in images, in ACM International Conference on Multimedia, 2005, pp. 716-724.
19. L. Itti and C. Koch, Computational modeling of visual attention, Nature Review Neuroscience,
vol. 2, no. 3, pp. 194-203, 2001.
Search WWH ::




Custom Search