default search action
ICME 2003: Baltimore, MD, USA
- Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, ICME 2003, 6-9 July 2003, Baltimore, MD, USA. IEEE Computer Society 2003, ISBN 0-7803-7965-9
Volume 1
Networked Video I
- Thinh P. Q. Nguyen, Puneet Mehra, Avideh Zakhor:
Path diversity and bandwidth allocation for multimedia streaming. 1-4 - Susie J. Wee, John G. Apostolopoulos, Wai-tian Tan, Sumit Roy:
Research and design of a mobile streaming media content delivery network. 5-8 - Jacob Chakareski, Eric Setton, Yi J. Liang, Bernd Girod:
Video streaming with diversity. 9-12 - Marco Fumagalli, Phoom Sagetong, Antonio Ortega:
Estimation of erased data in a H.263 coded stream by using unbalanced multiple description coding. 13-16 - Amy R. Reibman, Vinay A. Vaishampayan:
Quality monitoring for compressed video subjected to packet loss. 17-20
Automatic Indexing
- Rémi Ronfard, Tien Tran-Thuong:
A framework for aligning and indexing movies with their script. 21-24 - Xiaofei He, Wei-Ying Ma, Hong-Jiang Zhang:
Imagerank: spectral techniques for structural analysis of image database. 25-28 - Adam Berenzweig, Daniel P. W. Ellis, Steve Lawrence:
Anchor space for classification and similarity measurement of music. 29-32 - Tong Zhang:
Automatic singer identification. 33-36 - Matthew R. Boutell, Jiebo Luo, Robert T. Gray:
Sunset scene classification using simulated image recomposition. 37-40
Multimodal Interfaces
- Yeow Kee Tan, Nasser Sherkat, Tony Allen:
Eye gaze and speech for data entry: a comparison of different data entry methods. 41-44 - Yasuhito Sawahata, Kiyoharu Aizawa:
Wearable imaging system for summarizing personal experiences. 45-48 - Timothy T. H. Chen, Sidney S. Fels, Saehee Sarah Min:
FlowField and beyond: applying pressure-sensitive multi-point touchpad interaction. 49-52 - Xin Fan, Xing Xie, Wei-Ying Ma, Hong-Jiang Zhang, He-Qin Zhou:
Visual attention based image browsing on mobile devices. 53-56 - Björn W. Schuller, Martin Zobl, Gerhard Rigoll, Manfred K. Lang:
A hybrid music retrieval system using belief networks to integrate multimodal queries and contextual knowledge. 57-60
Speech and Audio Processing I
- Hsuan-Huei Shih, Shrikanth S. Narayanan, C.-C. Jay Kuo:
A statistical multidimensional humming transcription using phone level hidden Markov models for query by humming systems. 61-64 - Rongshan Yu, Xiao Lin, Susanto Rahardja, Chi Chung Ko:
A fine granular scalable perceptually lossy and lossless audio coder. 65-68 - Simon Lucey, Tsuhan Chen:
An investigation into subspace rapid speaker adaptation for verification. 69-72 - Manuel J. Reyes Gomez, Daniel P. W. Ellis:
Selection, parameter estimation, and discriminative training of hidden Markov models for general audio modeling. 73-76 - Chih-Kai Yang, Sou-Gee Chen:
New static and dynamic search algorithms for fast MP3 bit allocations. 77-80
Image Processing I
- Yongmin Li, Li-Qun Xu, Geoff Morrison, Charles Nightingale, Jason Morphett:
Robust panorama from MPEG video. 81-84 - Jun-Wei Hsieh:
Fast stitching algorithm for moving object detection and mosaic construction. 85-88 - Zhang John Chen, Jagath Samarabandu:
Planar region depth filling using edge detection with embedded confidence technique and Hough transform. 89-92 - S. H. Srinivasan, Mohan S. Kankanhalli:
Wide baseline spectral matching. 93-96 - Wei-Qi Yan, Mohan S. Kankanhalli:
Colorizing infrared home videos. 97-100 - Hasan F. Ates, Michael T. Orchard:
Image interpolation using wavelet-based contour estimation. 101-104 - Andy Chang, Oscar C. Au, Yick Ming Yeung:
A novel approach to fast multi-block motion estimation for H.264 video coding. 105-108 - Gulcin Caner, A. Murat Tekalp, Wendi B. Heinzelman:
Super resolution recovery for multi-camera surveillance imaging. 109-112 - Yu Hen Hu, Rajas A. Sambhare:
Constrained texture synthesis for image post processing. 113-116
Multimedia Architectures and Implementation
- Nikolaos Bellas, Malcolm Dwyer:
A programmable, high performance vector array unit used for real-time motion estimation. 117-120 - Tay-Jyi Lin, Chin-Chi Chang, Tsung-Hsun Yang, Yu-Ming Chang, Chien-Hung Lin, Chen-Chia Lee, Hung-Yueh Lin, Chein-Wei Jen:
Performance evaluation of ring-structure register file in multimedia applications. 121-124 - Tay-Jyi Lin, Tsung-Hsun Yang, Chein-Wei Jen:
Coefficient optimization for area-effective multiplier-less FIR filters. 125-128 - Satoshi Nishiguchi, Kazuhide Higashi, Yoshinari Kameda, Michihiko Minoh:
A sensor-fusion method for detecting a speaking student. 129-132 - Tsung-Han Tsai, Wen-Cheng Chen, Chun-Nan Liu:
A low power VLSI implementation for variable length decoder in MPEG-1 layer III. 133-136 - Hung-Chi Fang, Tu-Chih Wang, Yu-Wei Chang, Ya-Yun Shih, Liang-Gee Chen:
Novel word-level algorithm of embedded block coding in JPEG 2000. 137-140 - Jongmyon Kim, D. Scott Wills:
Quantized color instruction set for media-on-demand applications. 141-144 - Michelle Yan, James Shaw, Vahid Khamsi, Shih-Ping Liou:
Tracking and presenting user attention for collaborative browsing using heterogeneous devices. 145-148 - Shinsuke Kobayashi, Kentaro Mita, Yoshinori Takeuchi, Masaharu Imai:
Rapid prototyping of JPEG encoder using the ASIP development system: PEAS-III. 149-152
Text, Graphics, Face, Scene, and Song Recognition
- Ioannis Andreou, Nikitas M. Sgouros:
Sketch creation utilizing shape matching techniques. 153-156 - Michael H. Lee, Surya Nepal, Uma Srinivasan:
Edge-based semantic classification of sports video sequences. 157-160 - Gees C. Stein, Jens Rittscher, Anthony Hoogs:
Enabling video annotation using a semantic database extended with visual knowledge. 161-164 - Hidehisa Nagano, Kunio Kashino, Hiroshi Murase:
A fast search algorithm for background music signals based on the search for numerous small signal components. 165-168 - Ahmet Ekin, A. Murat Tekalp:
Generic play-break event detection for summarization and hierarchical sports video analysis. 169-172 - Amit Chakraborty, Peiya Liu, Liang H. Hsu:
Extracting anchorable information units from PDF files. 173-176 - Lijun Yin, Sergey Royt, Matt T. Yourst, Anup Basu:
Recognizing facial expressions using active textures with wrinkles. 177-180 - Francis K. H. Quek, Yingen Xiong:
Oscillatory gestures and discourse. 181-184
Networked Video II
- Haitao Zheng:
Optimizing wireless multimedia transmissions through cross layer design. 185-188 - Jacco R. Taal, Ivaylo Haratcherev, Koen Langendoen, Inald Lagendijk:
Quality of service controlled adaptive video-coding over IEEE 802.11 wireless links. 189-192 - Thomas Stockhammer:
Is fine-granular scalable video coding beneficial for wireless video applications? 193-196 - Jie Chen, S. Hsia:
Joint cross-layer design for wireless QoS video delivery. 197-200 - Trista Pei-Chun Chen, Tsuhan Chen:
Shaping for video with frame dependency. 201-204
Multimedia Security and Content Protection I
- H. Vicky Zhao, Min Wu, Z. Jane Wang, K. J. Ray Liu:
Performance of detection statistics under collusion attacks on independent multimedia fingerprints. 205-208 - Alexia Giannoula, Anastasios Tefas, Nikos Nikolaidis, Ioannis Pitas:
Improving the detection reliability of correlation-based watermarking techniques. 209-212 - Ming Sun Fu, Oscar C. Au:
A multi-bit robust watermark for halftone images. 213-216 - Nedeljko Cvejic, Djordje Tujkovic, Tapio Seppänen:
Increasing robustness of an audio watermark using turbo codes. 217-220 - Jonathan Foote, John Adcock, Andreas Girgensohn:
Time base modulation: a new approach to watermarking audio. 221-224
Virtual Reality and Imaging I
- Satya P. Mallick, Mohan M. Trivedi:
Parametric face modeling and affect synthesis. 225-228 - Inmaculada Rodríguez Santiago, Manuel Peinado, Ronan Boulic, Daniel Meziat:
Bringing the human arm reachable space to a virtual environment for its analysis. 229-232 - Cha Zhang, Tsuhan Chen:
A system for active image-based rendering. 233-236 - Yuzhong Shen, Kenneth E. Barner:
Surface denoising with directional fuzzy vector median filtering. 237-240 - Yong-In Yoon, Jang-Hwan Im, Dae-Hyun Kim, Jong-Soo Choi:
Reconstruction of linearly parameterized models using the vanishing points from a single image. 241-244
Authentication and Recognition
- Wende Zhang, Tsuhan Chen:
Personal authentication based on generalized symmetric max minimal distance in subspace. 245-248 - Thang Viet Nguyen, Jagdish Chandra Patra, Ee-Luang Ang:
Blind image extraction from nonlinear mixtures using MLP-based ICA. 249-252 - Wei Wang, Aidong Zhang, Yuqing Song:
Identification of objects from image regions. 253-256 - S. Palanivel, B. S. Venkatesh, B. Yegnanarayana:
Real time face authentication system using autoassociative neural network models. 257-260 - Dong-Wan Kang, Jun Ohya:
Postures of a human wearing a multiple-colored suit based on color information processing. 261-264
Wireless Multimedia Techniques
- Wei Wang, Michael R. Lyu:
Automatic generation of dubbing video slides for mobile wireless environment. 265-268 - Surya Nepal, Uma Srinivasan:
Adaptive video highlights for wired and wireless platforms. 269-272 - Dirk Trossen, Hemant H. Chaskar:
Enabling user-tailored MMS delivery in heterogeneous access scenarios. 273-276 - Shengjie Zhao, Zixiang Xiong, Xiaodong Wang:
Optimal resource allocation for wireless video over CDMA networks. 277-280 - Amol Bhatkar, Rajarathnam Chandramouli, Narayanan Vijaykrishnan, Mary Jane Irwin:
Computation and transmission energy modeling through profiling for MPEG4 video transmission. 281-284 - Wen Xu, Sheila S. Hemami:
Delay-optimized robust transmission of images over multiple channels. 285-288 - Wanghong Yuan, Klara Nahrstedt:
Buffering approach for energy saving in video sensors. 289-292 - Jiancong Chen, S.-H. Gary Chan, Qian Zhang, Wenwu Zhu, Jin Chen:
A distributed power adaptation algorithm for multimedia delivery over ad hoc networks. 293-296
Content-based Retrieval
- Jieh Hsiang, Wen-Jun Liu, Bee-Chung Chen, Hsieh-Chang Tu:
Multidimensional interactive fine-grained image retrieval. 297-300 - Jürgen Assfalg, Alberto Del Bimbo, Pietro Pala:
Curvature maps for 3D CBR. 301-304 - Xiangdong Zhou, Qi Zhang, Lan Lin, Ailin Deng, Gang Wu:
Image retrieval by fuzzy clustering of relevance feedback records. 305-308 - Jun Gao, George Tzanetakis, Peter Steenkiste:
Content-based retrieval of music in scalable peer-to-peer networks. 309-312 - Lei Zhang, Fang Qian, Mingjing Li, Hong-Jiang Zhang:
An efficient memorization scheme for relevance feedback in image retrieval. 313-316 - Yuxin Peng, Chong-Wah Ngo, Qing-Jie Dong, Zongming Guo, Jianguo Xiao:
Video clip retrieval by maximal matching and optimal matching in graph theory. 317-320 - Xin Huang, Shu-Ching Chen, Mei-Ling Shyu:
Incorporating real-valued multiple instance learning into relevance feedback for image retrieval. 321-324 - Ming Hong Pi, Mrinal Mandal, Anup Basu:
Image retrieval based on 2-D histogram of fractal parameters. 325-328 - Giridharan Iyengar, Harriet J. Nock, Chalapathy Neti:
Audio-visual synchrony for detection of monologues in video archives. 329-332 - Min Xu, Ling-Yu Duan, Changsheng Xu, Qi Tian:
A fusion scheme of visual and auditory modalities for event detection in sports video. 333-336
Image Processing II
- Ching-Yeh Chen, Shao-Yi Chien, Yi-Hau Chen, Yu-Wen Huang, Liang-Gee Chen:
Unsupervised object-based sprite coding system for tennis sport. 337-340 - Armando J. Pinho, António J. R. Neves:
Block-based histogram packing of color-quantized images. 341-344 - Nejat Kamaci, Yucel Altunbasak:
Performance comparison of the emerging H.264 video coding standard with the existing standards. 345-348 - Xiaodong Gu, Hong-Jiang Zhang:
Implementing dynamic GOP in video encoding. 349-352 - Yung-Gi Wu, Ming-Zhi Huang, Yu-Ling Wen:
Fractal image compression with variance and mean. 353-356 - Martin P. Boliek, Gene K. Wu:
JPEG 2000-like access using the JPM compound document file format. 357-360 - Shou-Yi Tseng:
Efficient motion estimation algorithm using run-time and distortion optimization approach. 361-364 - Liang Zhang:
Statistical model for intensity differences of corresponding points between stereo image pairs. 365-368 - Yuhua Ding, George J. Vachtsevanos, Anthony J. Yezzi Jr., Wayne Daley, Bonnie S. Heck-Ferri:
A real-time curve evolution-based image fusion algorithm for multisensory image segmentation. 369-372 - Bernd Girod, Chuo-Ling Chang, Prashant Ramanathan, Xiaoqing Zhu:
Light field compression using disparity-compensated lifting. 373-376
Speech Coding, Analysis, and Synthesis
- Christian H. Ritz, Ian S. Burnett, Jason Lukasiak:
Low bit rate wideband WI speech coding. 377-380 - Houman Zarrinkoub, Paul Mermelstein:
Joint optimization of short-term and long-term predictors in CELP speech coders. 381-384 - Om Deshmukh, Carol Y. Espy-Wilson:
A measure of aperiodicity and periodicity in speech. 385-388 - K. Sreenivasa Rao, B. Yegnanarayana:
Prosodic manipulation using instants of significant excitation. 389-392 - Arun Kumar, Ashish Verma:
Using phone and diphone based acoustic models for voice conversion: a step towards creating voice fonts. 393-396 - Xiaodong He, Wu Chou:
minimum classification error linear regression for acoustic model adaptation of continuous density HMMS. 397-400 - Björn W. Schuller, Gerhard Rigoll, Manfred K. Lang:
Hidden Markov model-based speech emotion recognition. 401-404 - Dong Wang, Lie Lu, Hong-Jiang Zhang:
Speech segmentation without speech recognition. 405-408 - Julien Pinquier, Jean-Luc Rouas, Régine André-Obrecht:
A fusion study in speech / music classification. 409-412
Multimedia Technology for Gaming
- Mohammed Chalil, K. P. Sreekumar, Manoj Sankar:
MPEG-4 based framework for game engines to handle virtual advertisements in game. 413-416 - Amaryllis Raouzaiou, Kostas Karpouzis, Stefanos D. Kollias:
Emotion representation for online gaming. 417-420 - Ghassan Al-Regib, Yucel Altunbasak:
3TP: an application-layer protocol for streaming 3-D graphics. 421-424 - Magy Seif El-Nasr, Ian Horswill:
Expressive lighting for interactive entertainment. 425-428 - Son Minh Tran, Marius Preda, Françoise J. Prêteux, Kalman Fazekas:
Exploring MPEG-4 BIFS features for creating multimedia games. 429-432
Multimedia Learning
- Raghavendra Singh, Ravi Kothari:
Relevance feedback algorithm based on learning from labeled and unlabeled data. 433-436 - Milind R. Naphade, Ching-Yung Lin, Apostol Natsev, Belle L. Tseng, John R. Smith:
A framework for moderate vocabulary semantic visual concept detection. 437-440 - Shinsuke Nakajima, Shinichi Kinoshita, Katsumi Tanaka:
Amplifying the differences between your positive samples and neighbors in image retrieval. 441-444 - Apostol Natsev, John R. Smith:
Active selection for multi-example querying by content. 445-448 - Tzvetanka I. Ianeva, Arjen P. de Vries, Hein Röhrig:
Detecting cartoons: a case study in automatic video-genre classification. 449-452
QoS
- Wuttipong Kumwilaisak, Qian Zhang, Wenwu Zhu, C.-C. Jay Kuo, Ya-Qin Zhang:
On the rate constraint of transmitting multiple priority classes with QoS. 453-456 - Bo Shen:
Meta-caching and meta-transcoding for server-side service proxy. 457-460 - Sheau-Ru Tong, Chun-Cheng Chang:
Harmonic DiffServ: provisioning scalable heterogeneous-QoS multicast in DiffServ networks. 461-464 - Rajeev Kumar:
A protocol with transcoding to support QoS over Internet for multimedia traffic. 465-468 - Nam Pham Ngoc, Gauthier Lafruit, Jean-Yves Mignolet, Serge Vernalde, Geert Deconinck, Rudy Lauwereins:
A framework for mapping scalable networked applications on run-time reconfigurable platforms. 469-472
Image/Video Rendering/Synthesis
- Pun-Mo Ho, Tien-Tsin Wong, Kwok-Hung Choy, Chi-Sing Leung:
PCA-based compression for image-based relighting. 473-476 - Amit A. Kale, Amit K. Roy-Chowdhury, Rama Chellappa:
Video based rendering of planar dynamic scenes. 477-480 - Sarah John, Mikhail A. Vorontsov:
Multiframe selective information fusion for 'looking through the woods'. 481-484 - Timothy K. Shih, Liang-Chen Lu, Ying-Hong Wang, Rong-Chi Chang:
Multi-resolution image inpainting. 485-488 - Zhanfeng Yue, Liang Zhao, Rama Chellappa:
View synthesis of articulating humans using visual hull. 489-492
Layered, Scalable & Multiple Descriptions Transmission
- Xiao Su, Rod Fatoohi:
Scalable coded image transmissions over peer-to-peer networks. 493-496 - Ji-An Zhao, Bo Li, Ishfaq Ahmad:
Traffic modeling for layered video. 497-500 - Lechang Cheng, Mabo Robert Ito:
Receiver-driven layered multicast using active networks. 501-504 - Chung-Ming Huang, Yuan-Tse Yu, Guo-Shiung Liau:
A statistical flow control mechanism for layered multimedia over the differentiated service network. 505-508 - Eric Setton, Yi J. Liang, Bernd Girod:
Adaptive multiple description video streaming over multiple channels with active probing. 509-512 - Ivan Lee, Ling Guan:
Centralized peer-to-peer streaming with layered video. 513-516 - Ali C. Begen, Yucel Altunbasak, Özlem Ergun:
Fast heuristics for multi-path selection for multiple description encoded video streaming. 517-520 - Bo Xie, Wenjun Zeng:
Source characteristics based fast bitstream switching. 521-524 - Augustin Gavrilescu, Adrian Munteanu, Peter Schelkens, Jan Cornelis:
Embedded multiple description scalar quantizers for progressive image transmission. 525-528
Image Compression
- Mylène C. Q. Farias, Sanjit K. Mitra, John M. Foley:
Perceptual contributions of blocky, blurry and noisy artifacts to overall annoyance. 529-532 - Jingdong Wang, Jianguo Lee, Changshui Zhang:
Kernel GMM and its application to image binarization. 533-536 - Rastislav Lukac, Bogdan Smolka, Konstantinos N. Plataniotis, Anastasios N. Venetsanopoulos:
Generalized adaptive vector sigma filters. 537-540 - Yao Nie, Kenneth E. Barner:
Optimized fuzzy transformation for image deblocking. 541-544 - Ee Ping Ong, Weisi Lin, Zhongkang Lu, Susu Yao, Xiaokang Yang, Lijun Jiang:
No-reference JPEG-2000 image quality metric. 545-548 - Giuseppe Messina, Alfio Castorina, Sebastiano Battiato, Angelo Bosco:
Image quality improvement by adaptive exposure correction techniques. 549-552 - Giovanni Motta, Francesco Rizzo, James A. Storer:
Partitioned vector quantization: application to lossless compression of hyperspectral images. 553-556 - Daewon Kim, Daekyu Shin:
Energy-based adaptive DCT/IDCT for video coding. 557-560 - Lorenzo Granai, Fulvio Moschetti, Pierre Vandergheynst:
Ridgelet transform applied to motion compensated images. 561-564
Coding and Noise Removal
- Phil Spencer Whitehead, David V. Anderson, Mark A. Clements:
Adaptive, acoustic noise suppression for speech enhancement. 565-568 - Ashish Jagmohan, Anshul Sehgal, Narendra Ahuja:
WYZE-PMD based multiple description video codec. 569-572 - Nualsawat Hiransakolwong, Kien A. Hua, Khanh Vu, Piotr S. Windyga:
Segmentation of ultrasound liver images: an automatic approach. 573-576 - Nuwan D. Nanayakkara, Jagath Samarabandu:
Unsupervised model based image segmentation using domain knowledge based fuzzy logic and edge enhancement. 577-580 - Zhengguo Li, Feng Pan, Keng Pang Lim, Genan Feng, Xiao Lin, Susanto Rahardja, Dajun Wu:
Adaptive frame layer rate control for H.264. 581-584 - Bogdan Smolka, Konstantinos N. Plataniotis, Rastislav Lukac, Anastasios N. Venetsanopoulos:
Similarity based impulsive noise removal in color images. 585-588 - Siva Somasundaram, Koduvayur P. Subbalakshmi:
3-D multiple description video coding for packet switched networks. 589-592 - Xu Huang, Allan C. Madoc, Andrew D. Cheetham:
Wavelet-based Bayesian estimator for Poisson noise removal from images. 593-596 - Hideaki Kimata, Masaki Kitahara, Yoshiyuki Yashima:
3D motion vector coding with block base adaptive interpolation filter on H.264. 597-600 - Ligang Lu, Vadim Sheinin:
Real-time MPEG video coding with information look-ahead. 601-604
Watermarking and Fingerprinting
- Micheal Mullarkey, Neil J. Hurley, Guenole C. M. Silvestre, Teddy Furon:
Application of side-informed embedding and polynomial detection to audio watermarking. 605-608 - Ming Sun Fu, Oscar C. Au:
A novel method to embed watermark in different halftone images: data hiding by conjugate error diffusion (DHCED). 609-612 - Hong Zhao, Min Wu, Z. Jane Wang, K. J. Ray Liu:
Nonlinear collusion attacks on independent fingerprints for multimedia. 613-616 - Z. Jane Wang, Min Wu, Hong Zhao, K. J. Ray Liu, Wade Trappe:
Resistance of orthogonal Gaussian fingerprints to collusion attacks. 617-620 - Jeffrey A. Bloom:
Security and rights management in digital cinema. 621-624 - Anandabrata Pal, Kulesh Shanmugasundaram, Nasir D. Memon:
Automated reassembly of fragmented images. 625-628 - Kaliappan Gopalan:
Audio steganography using bit modification. 629-632 - Heather Yu:
Scalable encryption for multimedia content access control. 633-636 - Andreas Kalivas, Anastasios Tefas, Ioannis Pitas:
Watermarking of 3D models using principal component analysis. 637-640
Video Processing for Multi-Camera Surveillance Systems
- Matteo Gandetto, Luca Marchesotti, S. Sciutto, D. Negroni, Carlo S. Regazzoni:
From multi-sensor surveillance towards smart interactive spaces. 641-644 - Ser-Nam Lim, Ahmed M. Elgammal, Larry S. Davis:
Image-based pan-tilt camera control in a multi-camera surveillance environment. 645-648 - Omar Javed, Zeeshan Rasheed, Orkun Alatas, Mubarak Shah:
KNIGHT™: a real time surveillance system for multiple and non-overlapping cameras. 649-652 - Fatih Porikli, Ajay Divakaran:
Multi-camera calibration, object tracking and query generation. 653-656 - Karsten Müller, Aljoscha Smolic, Michael Drose, Patrick Voigt, Thomas Wiegand:
Multi-texture modeling of 3D traffic scenes. 657-660
Wireless Multimedia I
- Jianping Hua, Zixiang Xiong:
Optimal rate allocation in progressive joint source-channel coding for image transmission over CDMA networks. 661-664 - Jie Chen:
Fast hopping OFDM and packet-awareness coder design for wireless multimedia delivery. 665-668 - Xiaofeng Xu, Mihaela van der Schaar, Santhana Krishnamachari, Sunghyun Choi, Yao Wang:
Adaptive error control for fine-granular-scalability video coding over IEEE 802.11 wireless LANs. 669-672 - Shirish Karande, Syed A. Khayam, Michael Krappel, Hayder Radha:
Analysis and modeling of errors at the 802.11b link layer. 673-676 - Yong Sun, Zixiang Xiong, Xiaodong Wang:
Iterative decoding of differentially space-time coded multiple descriptions of images. 677-680
Multimedia Hardware and Architectures
- Sebastiano Battiato, Alfio Castorina, Mirko Guarnera, Filippo Vella:
A light viewfinder pipeline for consumer devices application. 681-684 - Minseok Song, Heonshik Shin:
Minimization of buffer requirements using variable-size parity groups for fault-tolerant video servers. 685-688 - Chunjiang J. Duanmu, M. Omair Ahmad, M. N. S. Swamy:
8-bit partial sums of 16 luminance values for fast block motion estimation. 689-692 - Yu-Wen Huang, To-Wei Chen, Bing-Yu Hsieh, Tu-Chih Wang, Te-Hao Chang, Liang-Gee Chen:
Architecture design for deblocking filter in H.264/JVT/AVC. 693-696 - Xinjian Chen, Qionghai Dai:
A novel VLSI architecture for multidimensional discrete wavelet transform. 697-700
Novel Applications
- Juan Carlos Guerri, Carlos Enrique Palau, Ana Pajares, Angela Belda, Juan José Cermeño, Manuel Esteve:
A multimedia telemedicine system to assess musculoskeletal disorders. 701-704 - Shu-Ching Chen, Keqi Zhang, Min Chen:
A real-time 3D animation environment for storm surge. 705-708 - Panu Hämäläinen, Marko Hännikäinen, Timo D. Hämäläinen, Riku Soininen:
Offline architecture for real-time betting. 709-712 - Chung-Sheng Li, Charu C. Aggarwal, Murray Campbell, Yuan-Chi Chang, Gregory Glass, Vijay S. Iyengar, Mahesh Joshi, Ching-Yung Lin, Milind R. Naphade, John R. Smith, Belle L. Tseng, Min Wang, Kun-Lung Wu, Philip S. Yu:
Epi-SPIRE: a system for environmental and public health activity monitoring. 713-716 - John V. Harrison, Anna Andrusiewicz:
Enhancing digital advertising using dynamically configurable multimedia. 717-720
Speech and Audio Processing II
- Sunil Bharitkar, Philip Hilmes, Chris Kyriakakis:
Sensitivity of multichannel room equalization to listener position. 721-724 - Sascha Spors, Achim Kuntz, Rudolf Rabenstein:
Listening room compensation for wave field synthesis. 725-728 - Kenzo Obata, Kentaro Noguchi, Yoshiaki Tadokoro:
A new sound source location algorithm based on formant frequency for sound image localization. 729-732 - Arvindh Krishnaswamy, Julius O. Smith III:
Inferring control inputs to an acoustic violin from audio spectra. 733-736 - Yong Rui, Dinei A. F. Florêncio:
New direct approaches to robust sound source localization. 737-740 - Parham Aarabi, Guangji Shi, Omid S. Jahromi:
Robust speech separation using time-frequency masking. 741-744 - Zhe Feng, Yaqian Zhou, Lide Wu, Zongge Li:
Audio classification based on maximum entropy model. 745-748 - Kuntal Sengupta, Prabir Burman:
Non-parametric approach to ICA using kernel density estimation. 749-752 - Jean-Luc Rouas, Jérôme Farinas, François Pellegrino, Régine André-Obrecht:
Modeling prosody for language identification on read and spontaneous speech. 753-756
Multimedia Indexing
- Yimin Wu, Aidong Zhang:
An adaptive classification method for multimedia retrieval. 757-760 - Janghyun Yoon, Nikil Jayant:
Semantics-sensitive image retrieval: an information fusion approach. 761-764 - Anlei Dong, Bir Bhanu:
Concept learning and transplantation for dynamic image databases. 765-768 - Paisarn Muneesawang, Ling Guan:
Image retrieval with embedded sub-class information using Gaussian mixture models. 769-772 - Jeroen Vendrig, Marcel Worring, Arnold W. M. Smeulders:
Components and systems for interactive video indexing. 773-776 - Andrea Kutics, Akihiko Nakagawa, Kiyotaka Tanaka, Minoru Yamada, Yasuo Sambe, Sakuichi Ohtsuka:
Linking images and keywords for semantics-based image retrieval. 777-780 - Alejandro Jaimes, John R. Smith:
Semi-automatic, data-driven construction of multimedia ontologies. 781-784 - Keiji Yanai:
Image collector II: a system for gathering more than one thousand images from the Web for one keyword. 785-788
QoS and Broadcasts
- Corina Scheiter, Rainer Steffen, Markus Zeller, Rudi Knorr, Benno Stabernack, Kai-Immo Wels:
A system for QOS-enabled MPEG-4 video transmission over Bluetooth for mobile applications. 789-792 - Chin-Hei Chien, Wanjiun Liao:
A self-configuring RED gateway for quality of service (QoS) networks. 793-796 - Jia Zhang, Jen-Yao Chung, Zhixing Zhang:
A router model for QoS-based multimedia Web services. 797-800 - Hong Kee Sul, Hyunchul Kim, Kilnam Chon:
A hybrid pagoda broadcasting protocol: fixed-delay pagoda broadcasting protocol with partial preloading. 801-804 - Nera W. C. Liu, Jack Y. B. Lee:
Constrained consonant broadcasting - a generalized periodic broadcasting scheme for large scale video streaming. 805-808 - Yeonjoon Chung, Ahmed H. Tewfik:
An efficient video broadcasting protocol with scalable preloading scheme. 809-812 - Virgilio Rodriguez:
Resource management for scalably encoded information: the case of image transmission over wireless networks. 813-816 - Chow-Sing Lin, Tzong-Yao Chang, Jin-Ru Hsieh:
On utilizing multi-channel to provide scheduled video delivery. 817-820 - Deepak S. Turaga, Mihaela van der Schaar:
Content-adaptive filtering in the UMCTF framework. 821-824 - Zhizhong Zhe, Hong Ren Wu, Zhenghua Yu, Tim Ferguson, Damian M. Tan:
Performance evaluation of a perceptual ringing distortion metric for digital video. 825-828
Signal Processing Theory and Methods I
- Abdessamad Ben Hamza, Hamid Krim, Bilge Karaçali:
Structural risk minimization using nearest neighbor rule. 829-832 - Hamid Reza Abutalebi, Hamid Sheikhzadeh, Robert L. Brennan, George H. Freeman:
Affine projection algorithm for oversampled subband adaptive filters. 833-836 - Mohammad Bilal Malik:
State-space RLS. 837-840 - Behrouz Nowrouzian, Arthur T. G. Fuller, M. N. S. Swamy:
A necessary and sufficient condition for the BIBO stability of general-order bode-type variable-amplitude wave-digital equalizers. 841-844 - Zhong Ji, Shuren Qi:
Detection of EEG basic rhythm feature by using band relative intensity ratio(BRIR). 845-848 - Kamyar Hazaveh, Kaamran Raahemifar:
Optimized local discriminant basis algorithm. 849-852 - Palghat P. Vaidyanathan, Byung-Jun Yoon:
Discrete probability density estimation using multirate DSP models. 853-856 - Andre Tkacenko, Palghat P. Vaidyanathan:
On the least squares signal approximation model for overdecimated rational nonuniform filter banks and applications. 857-860 - J. Michael Peterson, Shubha Kadambe:
A probabilistic approach for blind source separation of underdetermined convolutive mixtures. 861-864 - Jie Liang, Lu Gan, Chengjie Tu, Trac D. Tran, Kai-Kuang Ma:
On efficient implementation of oversampled linear phase perfect reconstruction filter banks. 865-868
Volume 2
Smart Cameras
- Jörn Jachalsky, Martin Wahler, Peter Pirsch, S. Capperon, Winfried Gehrke, W. M. Kruijtzer, Antonio Núñez:
A core for ambient and mobile intelligent imaging applications. 1-4 - Wayne H. Wolf, I. Burak Özer, Tiehan Lv:
Architectures for distributed smart cameras. 5-8 - Kohsia S. Huang, Mohan M. Trivedi:
Distributed video arrays for tracking, human identification, and activity analysis. 9-12 - John W. Fisher III, Trevor Darrell:
Learning cross-modal appearance models with application to tracking. 13-16 - Jacky Mallett, M. Michael Bove Jr.:
Eye Society. 17-20
Multimedia Retrieval
- Feng Jing, Mingjing Li, Hong-Jiang Zhang, Bo Zhang:
Support vector machines for region-based image retrieval. 21-24 - Charles Parker:
Towards intelligent string matching in query-by-humming systems. 25-28 - Wing Ho Leung, Tsuhan Chen:
Hierarchical matching for retrieval of hand-drawn sketches. 29-32 - Joo-Hwee Lim, Philippe Mulhem, Qi Tian:
Event-based home photo retrieval. 33-36 - Bo Feng, Qing Li, Jun Yang, Liu Wenyin, Jian Zhai:
Efficient database facilities for content-based Flash retrieval. 37-40
Network Adaptive Techniques
- Hui Cheng, Xi Min Zhang, Yun-Qing Shi, Anthony Vetro, Huifang Sun:
Rate allocation for FGS coded video using composite R-D analysis. 41-44 - Nicola Franchi, Marco Fumagalli, Rosa Lancini:
Optimised source and channel coding for video transmission over ADSL. 45-48 - Gene Cheung, Connie Chan:
Jointly optimal reference frame & quality of service selection for H.261 video coding over lossy networks. 49-52 - Ashwatha Matthur, Padmavathi Mundur:
Dynamic load balancing across mirrored multimedia servers. 53-56 - Hongliang Li, Guizhong Liu, Yongli Li, Zhongwei Zhang:
An effective burstiness estimation model for VBR video stream. 57-60
Multimedia Software and Architectures
- Oliver Schreer, Nicole Atzpadin, Serap Askar, Peter Kauff:
Advanced 3D signal processing for Virtual Team User Environments. 61-64 - James C. Beyer, David H. C. Du:
Data storage and delivery protocols to support interactive high-resolution image browsing on a PC-cluster based image-wall. 65-68 - Wei Shu, Min-You Wu:
Scalability of closed-loop video delivery service. 69-72 - Marc Leeman, David Atienza Alonso:
Intermediate variable elimination in a global context for a 3D multimedia application. 73-76 - Andreas Girgensohn:
A fast layout algorithm for visual video summaries. 77-80
Virtual Reality and Imaging II
- Kostas Karpouzis, Amaryllis Raouzaiou, Paraskevi K. Tzouveli, Spiros Ioannou, Stefanos D. Kollias:
MPEG-4: one multimedia standard to unite all. 81-84 - Takahito Kawanishi, Masaru Tsuchida, Shigeru Takagi, Hiroshi Murase:
Small cylindrical display for anthropomorphic agents. 85-88 - Hitoshi Kanda, Jun Ohya:
Efficient, realistic method for animating dynamic behaviors of 3D botanical trees. 89-92 - Wang Hee Lee, Kuntal Sengupta, Rajeev Sharma:
Augmented reality with occlusion rendering using background-foreground segmentation and trifocal tensors. 93-96 - Lijun Jiang, Shiqian Wu, Dajun Wu, Ee Ping Ong, Susanto Rahardja:
3D shape modeling by color phase stepping light projection. 97-100 - Angus M. K. Siu, Rynson W. H. Lau:
Relief occlusion-adaptive meshes for 3D imaging. 101-104 - Roberta L. Gomes, Guillermo de Jesús Hoyos-Rivera, Jean-Pierre Courtiat:
Collaborative virtual environments: going beyond virtual reality. 105-108 - Irene Cheng:
Efficient 3D object simplification and fragmented texture scaling for online visualization. 109-112
Robustness, Error Concealment and Loss Recovery
- Wenjun Zeng:
Spatial-temporal error concealment with side information for standard video codecs. 113-116 - Hyunjoo Kim, Sooyong Kang, Heon Young Yeom:
Node selection for a fault-tolerant streaming service on a peer-to-peer network. 117-120 - Thenghong H. Yeo, Wai Choong Wong, Dong-Yan Huang:
Soft decision unequal error protection scheme for MPEG advanced audio coding. 121-124 - Fan Zhai, Randall Berry, Thrasyvoulos N. Pappas, Aggelos K. Katsaggelos:
A rate-distortion optimized error control scheme for scalable video streaming over the Internet. 125-128 - Shirish S. Karande, Hayder Radha:
A new family of channel coding schemes for real-time visual communications. 129-132 - Gaurav Agarwal, Alwin Anbu, Aniruddha Sinha:
A fast algorithm to find the region-of-interest in the compressed MPEG domain. 133-136 - Chui Sian Ong, Klara Nahrstedt, Wanghong Yuan:
Quality of protection for mobile multimedia applications. 137-140 - Timothy K. Shih, Louis H. Lin, Jen-Shiun Chiang:
Progressive image transmission by adaptive interpolation. 141-144 - Wei-Ying Kung, Chang-Su Kim, C.-C. Jay Kuo:
A spatial-domain error concealment method with edge recovery and selective directional interpolation. 145-148 - Pascal Bourdon, Bertrand Augereau, Christian Olivier, Christian Chatellier:
A PDE-based method for ringing artifact removal on grayscale and color JPEG2000 images. 149-152
Networked Multimedia
- Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang:
Replication strategies for peer-to-peer based multimedia distribution service. 153-156 - Amir Asif:
Multimedia learning objects for digital signal processing in communications. 157-160 - David S. Doermann, Arvind Karunanidhi, Niketu Parekh, M. A. Khan, S. Chen, Hasan Timucin Ozdemir, M. Miwa, Kuo Chu Lee:
Issues in the transmission, analysis, storage and retrieval of surveillance video. 161-164 - Tayeb Lemlouma, Nabil Layaïda:
Encoding multimedia presentations for user preferences and limited environments. 165-168 - Keng Pang Lim, Dajun Wu, Si Wu, Susanto Rahardja, Xiao Lin, Lijun Jiang, Rongshan Yu, Feng Pan, Zhengguo Li, Susu Yao, Genan Feng, Chi Chung Ko:
Video streaming on embedded devices through GPRS network. 169-172 - Qiang Ma, Katsumi Tanaka:
WebTelop: dynamic TV-content augmentation by using Web pages. 173-176 - Yasuhiko Watanabe, Kazuya Sono, Kazuya Yokomizo, Yoshihiro Okada:
Translation camera on mobile phone. 177-180 - Mihai M. Lazarescu, Svetha Venkatesh:
Using camera motion to identify types of American football plays. 181-184
Moving from Features to Semantics using Computational Media Aesthetics
- Marc Davis:
Active capture: integrating human-computer interaction and computer vision/audition to automate media capture. 185-188 - Hangzai Luo, Jianping Fan, Jing Xiao, Xingquan Zhu:
Semantic principal video shot classification via mixture Gaussian. 189-192 - Simon Moncrieff, Svetha Venkatesh, Chitra Dorai:
Horror film genre typing and scene labeling via audio analysis. 193-196 - Barbara Barry, Glorianna Davenport:
Documenting life: videography and common sense. 197-200 - John W. Mateer:
Developing effective test sets and metrics for evaluating automated media analysis systems. 201-204
Multimedia Security and Content Protection II
- Yan Sun, K. J. Ray Liu:
Multi-layer key management for secure multimedia multicast communications. 205-208 - Qibin Sun, Dajun He, Zhishou Zhang, Qi Tian:
A secure and robust approach to scalable video authentication. 209-212 - Dekun Zou, Chai Wah Wu, Guorong Xuan, Yun-Qing Shi:
A content-based image authentication system with lossless data hiding. 213-216 - Z. Jane Wang, Min Wu, Wade Trappe, K. J. Ray Liu:
Anti-collusion of group-oriented fingerprinting. 217-220 - Ankur Datta, Niels da Vitoria Lobo, John J. Leeson:
Novel feature vector for image authentication. 221-224
Source and Channel Coding
- Shan Liu, C.-C. Jay Kuo:
Joint temporal-spatial rate control for adaptive video transcoding. 225-228 - Seong Hwan Jang, Nikil Jayant:
An adaptive non-linear motion vector resampling algorithm for down-scaling video transcoding. 229-232 - Hua Yang, Kenneth Rose:
Source-channel prediction in error resilient video coding. 233-236 - Tao Chen, Zhihai He:
Single-pass distortion-smoothing encoding for low bit-rate video streaming applications. 237-240 - Cheng-Yu Pai, William E. Lynch:
MPEG-4 rate control algorithm using Laplace parameter estimation. 241-244
Image Coding and Enhancement
- Haohong Wang, Guido M. Schuster, Aggelos K. Katsaggelos:
Minmax optimal shape coding using skeleton decomposition. 245-248 - Min Shao, Kenneth E. Barner:
Soft-partition-based weighted sum filters for image enhancement. 249-252 - Yibin Yang, Lilla Böröczky:
Joint resolution enhancement and artifact reduction for MPEG-2 encoded digital video. 253-256 - Haohong Wang, Guido M. Schuster, Aggelos K. Katsaggelos:
Operational rate-distortion optimal bit allocation between shape and texture for MPEG-4 video coding. 257-260 - Zhibin Pan, Koji Kotani, Tadahiro Ohmi:
A fast full search equivalent encoding method for vector quantization by using appropriate features. 261-264
Video Analysis
- Xinguo Yu, Qi Tian, Kongwah Wan:
A novel ball detection framework for real soccer video. 265-268 - Ying Li, Yufei Ma, Hong-Jiang Zhang:
Salient region detection and tracking in video. 269-272 - Xinguo Yu, Changsheng Xu, Qi Tian, Hon Wai Leong:
A ball tracking framework for broadcast soccer video. 273-276 - Rainer Lienhart, Luhong Liang, Alexander Kuranov:
A detector tree of boosted classifiers for real-time object detection and tracking. 277-280 - Min Xu, Namunu C. Maddage, Changsheng Xu, Mohan S. Kankanhalli, Qi Tian:
Creating audio keywords for event detection in soccer video. 281-284 - Shunsuke Kamijo, Masao Sakauchi:
Segmentation of vehicles and pedestrians in traffic scene by spatio-temporal Markov random field model. 285-288 - Alan Hanjalic:
Multimodal approach to measuring excitement in video. 289-292 - Rong Jin, Alexander G. Hauptmann:
Learning to identify video shots with people based on face detection. 293-296 - Yang Ran, Qinfen Zheng:
Multi moving people detection from binocular sequences. 297-300 - Zuzana Cernekova, Constantine Kotropoulos, Ioannis Pitas:
Video shot segmentation using singular value decomposition. 301-304
Multimedia Streaming Architecture
- Hai Jin, Dafu Deng:
HHMSM: a hierarchical hybrid multicast stream merging scheme for large-scale video-on-demand systems. 305-308 - Zhen Li, Guobin Shen, Shipeng Li, Edward J. Delp:
L-TFRC: an end-to-end congestion control mechanism for video streaming over the Internet. 309-312 - Chen-Lung Chan, Shih-Yu Huang, Jia-Shung Wang:
Cooperative cache framework for video streaming applications. 313-316 - Toufik Ahmed, Ahmed Mehaoua, Vincent Lecuire:
Streaming MPEG-4 audio visual objects using TCP-friendly rate control and unequal error protection. 317-320 - Longin Jan Latecki, Kishore Kulkarni, Jaiwant Mulik:
Better audio performance when video stream is monitored by TCP congestion control. 321-324 - Xuxian Jiang, Yu Dong, Dongyan Xu, Bharat K. Bhargava:
GnuStream: a P2P media streaming system prototype. 325-328 - Jun Guo, Peter G. Taylor, Moshe Zukerman, Sammy Chan, Kit-Sang Tang, Eric W. M. Wong:
On the efficient use of video-on-demand storage facility. 329-332 - Michael Harville, Michele Covell, Susie J. Wee:
An architecture for componentized, network-based media services. 333-336
Image Classification and Detection
- Sungju Youm, Woosaeng Kim:
Dynamic threshold method for scene change detection. 337-340 - Woosaeng Kim, Ji Yoon Kim:
Image classification using spatial relationship matrix based on color spatio-histogram. 341-344 - Xavier Gibert, Huiping Li, David S. Doermann:
Sports video classification using HMMS. 345-348 - Shaohua Kevin Zhou, Rama Chellappa, Baback Moghaddam:
Adaptive visual tracking and recognition using particle filters. 349-352 - Hwajeong Lee, Daehwan Kim, Daijin Kim, Sung Yang Bang:
Real-time automatic vehicle management system using vehicle tracking and car plate number identification. 353-356 - Junqiang Lan, Xinhua Zhuang:
Embedded SLCCA for wavelet image coding. 357-360 - Jian Zhou, Huai-Rong Shao, Chia Shen, Ming-Ting Sun:
FGS enhancement layer truncation with minimized intra-frame quality variation. 361-364 - Aya Aner-Wolf:
Determining a scene's atmosphere by film grammar rules. 365-368 - Mukesh A. Zaveri, Uday B. Desai, S. N. Merchant:
Tracking multiple maneuvering point targets using multiple filter bank in infrared image sequence. 369-372
Indexing, Segmentation, and Retrieval
- Paisarn Muneesawang, Ling Guan:
Automatic relevance feedback for video retrieval. 373-376 - Miki Haseyama, Isao Kondo:
2-D functional AR model for image identification. 377-380 - Chi-Man Pun:
Invariant content-based image retrieval by wavelet energy signatures. 381-384 - Jiqiang Song, Min Cai, Michael R. Lyu:
A robust statistic method for classifying color polarity of video text. 385-388 - Akisato Kimura, Kunio Kashino, Takayuki Kurozumi, Hiroshi Murase:
Dynamic-segmentation-based feature dimension reduction for quick audio/video searching. 389-392 - Miki Haseyama, Atsushi Matsumura:
A trainable retrieval system for cartoon character images. 393-396 - Sangoh Jeong, Chee Sun Won, Robert M. Gray:
Histogram-based image retrieval using Gauss mixture vector quantization. 397-400 - Qixiang Ye, Wen Gao, Wei Zeng:
Color image segmentation using density-based clustering. 401-404
Video Segmentation for Semantic Annotation and Transcoding
- Ba Tu Truong, Svetha Venkatesh, Chitra Dorai:
Identifying film takes for cinematic analysis. 405-408 - Nathalie Peyrard, Patrick Bouthemy:
Motion-based selection of relevant video segments for video summarisation. 409-412 - Winston H. Hsu, Shih-Fu Chang:
A statistical framework for fusing mid-level perceptual features in news story segmentation. 413-416 - Anthony Vetro, Tetsuji Haga, Kazuhiko Sumi, Huifang Sun:
Object-based coding for long-term archive of surveillance video. 417-420 - Marco Bertini, Rita Cucchiara, Alberto Del Bimbo, Andrea Prati:
Object and event detection for semantic annotation and transcoding. 421-424
Wireless Multimedia II
- Syed A. Khayam, Shirish Karande, Michael Krappel, Hayder Radha:
Cross-layer protocol design for real-time multimedia applications over 802.11 b networks. 425-428 - Fan Yang, Qian Zhang, Wenwu Zhu, Ya-Qin Zhang:
An end-to-end TCP-friendly streaming protocol for multimedia over wireless Internet. 429-432 - Zhijun Lei, Nicolas D. Georganas:
Rate adaptation transcoding for video streaming over wireless channels. 433-436 - Yong Pei, James W. Modestino:
Interactive video coding and transmission over wired-to-wireless IP networks using an edge proxy. 437-440 - Allen Miu, John G. Apostolopoulos, Wai-tian Tan, Mitchell D. Trott:
Low-latency wireless video over 802.11 networks using path diversity. 441-444
Multimedia Semantics
- John R. Smith, Milind R. Naphade, Apostol Natsev:
Multimedia semantic indexing using model vectors. 445-448 - Dinh Quoc Phung, Svetha Venkatesh, Chitra Dorai:
On the extraction of thematic and dramatic functions of content in educational videos. 449-452 - Brett Adams, Chitra Dorai, Svetha Venkatesh, Hung Hai Bui:
Indexing narrative structure and semantics in motion pictures with a probabilistic framework. 453-456 - Jiebo Luo, Amit Singhal, Weiyu Zhu:
Natural object detection in outdoor scenes based on probabilistic spatial context models. 457-460 - Shinichi Takagi, Shinobu Hattori, Kazumasa Yokoyama, Akihisa Kodate, Hideyoshi Tominaga:
Sports video categorizing method using camera motion parameters. 461-464
Face, Body, and Audio-visual Analysis
- Yong Ma, Xiaoqing Ding:
Robust real-time face detection based on cost-sensitive AdaBoost method. 465-468 - Jonathan H. Connell, Norman Haas, Etienne Marcheret, Chalapathy Neti, Gerasimos Potamianos, Senem Velipasalar:
A real-time prototype for small-vocabulary audio-visual ASR. 469-472 - Mingkun Li, Dongge Li, Nevenka Dimitrova, Ishwar K. Sethi:
Audio-visual talking face detection. 473-476 - Chung-Lin Huang, Chia-Ying Chung:
A real-time model-based human motion analysis system. 477-480 - Petar S. Aleksic, Aggelos K. Katsaggelos:
Product HMMs for audio-visual continuous speech recognition using facial animation parameters. 481-484
Multimedia Security and Content Protection III
- Peter Hon-Wah Wong, Yick Ming Yeung, Oscar C. Au:
Capacity for JPEG2000-to-JPEG2000 images watermarking. 485-488 - Chun-Shien Lu:
Dual security-based image steganography. 489-492 - Yongdong Wu, Feng Bao, Changsheng Xu:
The security flaws in some authentication watermarking schemes. 493-496 - Huiping Guo, Nicolas D. Georganas:
Digital image watermarking for joint ownership verification without a trusted dealer. 497-500 - Feilong Liu, Yangsheng Wang:
An improved block dependent fragile image watermark. 501-504 - Gwenaël J. Doërr, Jean-Luc Dugelay:
New intra-video collusion attack using mosaicing. 505-508 - Shaohui Liu, Hongxun Yao, Wen Gao:
Neural network based steganalysis in still images. 509-512 - Serhat Erküçük, Sridhar Krishnan, Mehmet Zeytinoglu:
Robust audio watermarking using a chirp based technique. 513-516
Multimedia Distribution
- Min-You Wu, Wei Shu:
Video distribution with edge stations and Wi-Fi delivery networks. 521-524 - Si Woong Jang, Yong Woon Park:
A dynamic multicasting policy based on proxy caching. 525-528 - Bahjat Qazzaz, Javier Moreno, Xiaoyuan Yang, Porfidio Hernández, Remo Suppi, Emilio Luque:
Admission control policies for video on demand brokers. 529-532 - Qiang Liu, Jenq-Neng Hwang:
A new congestion control algorithm for layered multicast in heterogeneous multimedia dissemination. 533-536 - Hugh Melvin, Liam Murphy:
An integrated NTP-RTCP solution to audio skew detection and compensation for VoIP applications. 537-540 - Hong Man, Yang Li:
Multi-stream video transport over DiffServ wireless LANS. 541-544 - Syed Irtiza Ali, Hayder Radha:
Hierarchical handoff schemes over wireless LAN/WAN networks for multimedia applications. 545-548
Image Compression and Modeling
- Takahiro Nakayama, Masahiro Konda, Koji Takeuchi, Koji Kotani, Tadahiro Ohmi:
Adaptive resolution vector quantization technique and basic codebook design method for compound image compression. 549-552 - Xingsong Hou, Guizhong Liu:
A wavelet packet image coding algorithm based on quadtree classification and UTCQ. 553-556 - Xiaopeng Fan, Yan Lu, Wen Gao:
A novel coefficient scanning scheme for directional spatial prediction-based image compression. 557-560 - Deepak S. Turaga, Mihaela van der Schaar:
Reduced complexity spatio-temporal scalable motion compensated wavelet video encoding. 561-564 - Yuxin Liu, Zhen Li, Paul Salama, Edward J. Delp:
A discussion of leaky prediction based scalable coding. 565-568 - Jean Cardinal:
Compression of side information. 569-572 - Feng Pan, Zhengguo Li, Keng Pang Lim, Dajun Wu, Rongshan Yu, Genan Feng:
An adaptive rate control algorithm for video coding over personal digital assistants (PDA). 573-576 - Geovanni Martinez:
Maximum-likelihood motion estimation of a human face. 577-580 - Mihaela van der Schaar, Deepak S. Turaga:
Unconstrained motion compensated temporal filtering (UMCTF) framework for wavelet video coding. 581-584 - Shan Suthaharan:
A perceptually significant block-edge impairment metric for digital video coding. 585-588
Signal Processing Theory and Methods II
- Zheng Fang, Yingbo Hua:
Maximum likelihood method for blind identification of multiple autoregressive channels. 589-592 - Khim Sia Tan, Woon-Seng Gan, Jun Yang, Meng Hwa Er:
Constant beamwidth beamformer for difference frequency in parametric array. 593-596 - Omid S. Jahromi, Parham Aarabi:
Time delay estimation and signal reconstruction using multi-rate measurements. 597-600 - Yunnan Wu, Sun-Yuan Kung:
Detection for MIMO systems with imprecise channel knowledge. 601-604 - Xinying Zhang, Sun-Yuan Kung:
Capacity analysis for parallel and sequential MIMO equalizers. 605-608 - Timo Roman, Mihai Enescu, Visa Koivunen:
Time-domain method for tracking dispersive channels in MIMO OFDM systems. 609-612 - Frank Papenfuß, Yuri Artyukh, Eugene S. Boole, Dirk Timmermann:
Optimal sampling functions in nonuniform sampling driver designs to overcome the Nyquist limit. 613-616 - Pamornpol Jinachitra:
Constrained EM estimates for harmonic source separation. 617-620 - Khaled Amleh, Hongbin Li:
Blind code timing and carrier offset estimation for DS-CDMA systems. 621-624 - M. Mauricio Lara, Aldo G. Orozco-Lugo, Desmond C. McLernon, Hugo J. Muro-Lemus:
Blind recovery of multiple packets in ad hoc mobile networks using polynomial phase modulating sequences. 625-628
Multimedia Authoring and Presentation
- Jun Kong, Meikang Qiu, Kang Zhang:
Authoring multimedia documents through grammatical specifications. 629-632 - Zhaohui Sun, Jon Riek, Alexander C. Loui:
High resolution multimedia slide show composition for video CD and DVD rendering. 633-636 - Tero Jokela:
Authoring tools for mobile multimedia content. 637-640 - Heikki Keränen, Tapani Rantakokko, Jani Mäntyjärvi:
Sharing and presenting multimedia and context information within online communities using mobile terminals. 641-644 - Ilpo Koskinen:
User-generated content in mobile multimedia: empirical evidence from user studies. 645-648
Multimedia Streaming
- Yang Guo, Kyoungwon Suh, Jim Kurose, Don Towsley:
A peer-to-peer on-demand streaming service and its performance evaluation. 649-652 - Joohee Kim, Russell M. Mersereau, Yucel Altunbasak:
Network-adaptive video streaming using multiple description coding and path diversity. 653-656 - Giancarlo Fortino, Wilma Russo, Eugenio Zimeo:
Enhancing cooperative playback systems with efficient encrypted multimedia streaming. 657-660 - Matthias Ohlenroth, Hermann Hellwagner:
A protocol for adaptation-aware multimedia streaming. 661-664 - Yufeng Shan, Shivkumar Kalyanaraman:
Hybrid video downloading/streaming over peer-to-peer networks. 665-668
Capturing and Indexing Multimedia Events and Content
- Werner Geyer, Heather Richter, Gregory D. Abowd:
Making multimedia meeting records more meaningful. 669-672 - Jiqiang Song, Michael R. Lyu, Jenq-Neng Hwang, Min Cai:
PVCAIS: a personal videoconference archive indexing system. 673-676 - Yoshinari Kameda, Satoshi Nishiguchi, Michihiko Minoh:
CARMUL: concurrent automatic recording for multimedia lecture. 677-680 - Nikolai Joukov, Tzi-cker Chiueh:
Lectern II: a multimedia lecture capturing and editing system. 681-684 - Avare Stewart, Patrick Wolf, Matthias L. Hemmje:
Media and metadata management for capture and access systems in electronic lecturing environments. 685-688
Image/Video Indexing and Retrieval
- Yanjun Qi, Alexander G. Hauptmann, Ting Liu:
Supervised classification for video shot segmentation. 689-692 - Zhu Li, Aggelos K. Katsaggelos, Bhavan Gandhi:
Temporal rate-distortion based optimal video summary generation. 693-696 - Ankur Mani:
Video segmentation using stabilized inverse diffusion. 697-700 - Daekyu Shin, Daewon Kim, Hyunsool Kim, Sanghui Park:
An image retrieval technique using rotationally invariant Gabor features and a localization method. 701-704 - Alexander Haubold, John R. Kender:
Analysis and interface for instructional video. 705-708
Speech and Audio Processing III
- Manu Mathew, Vasudha Bhat, Shine M. Thomas, Changhoon Yim:
Modified MP3 encoder using complex modified cosine transform. 709-712 - Björn W. Schuller, Gerhard Rigoll, Manfred K. Lang:
HMM-based music retrieval using stereophonic feature information and framelength adaptation. 713-716 - Aaron S. Master, Yi-Wen Liu:
Robust chirp parameter estimation for Hann windowed signals. 717-720 - Ting-Yao Wu, Lie Lu, Ke Chen, Hong-Jiang Zhang:
UBM-based incremental speaker adaptation. 721-724 - Cheng-Yuan Lin, Jyh-Shing Roger Jang:
New refinement schemes for voice conversion. 725-728 - Dong-Yan Huang, Ruihua Ma:
Integer fast modified cosine transform. 729-732 - Hadi Harb, Liming Chen:
Gender identification using a general audio classifier. 733-736 - Jouni Paulus, Anssi Klapuri:
Conventional and periodic N-grams in the transcription of drum sequences. 737-740 - Steven J. Rennie, Parham Aarabi, Trausti T. Kristjansson, Brendan J. Frey, Kannan Achan:
Robust variational speech separation using fewer microphones than speakers. 741-744
Video Processing for Multimedia Interaction
- Alexandre R. J. François, Eun-Young Elaine Kang:
A handheld mirror simulation. 745-748 - Jamey Graham, Jonathan J. Hull:
A paper-based interface for video browsing and retrieval. 749-752 - Frank M. Shipman III, Andreas Girgensohn, Lynn Wilcox:
Creating navigable multi-level video summaries. 753-756 - Lalitha Agnihotri, Nevenka Dimitrova, John R. Kender, John Zimmerman:
Study on requirement specifications for personalized multimedia summarization. 757-760 - Chun-Chuan Yang, Chih-Wen Tien, Yung-Chi Wang:
Supporting VCR-like operations in SMIL2.0 players. 761-764 - Erkut Erdem, Aykut Erdem, Volkan Atalay, A. Enis Çetin:
Computer vision based unistroke keyboard system and mouse for the handicapped. 765-768 - Ishwar Ramani, Rajiv P. Bharadwaja, P. Venkat Rangan:
Location tracking for media appliances in wireless home networks. 769-772 - Lujun Yuan, Wen Gao, Yan Lu:
Latest arrival time leaky bucket for HRD constrained video coding. 773-776
Motion Estimation
- Charay Lerdsudwichai, Mohamed Abdel-Mottaleb:
Algorithm for multiple faces tracking. 777-780 - Patrick Lanvin, Jean-Charles Noyer, Mohammed Benjelloun:
Non-linear estimation of image motion and tracking. 781-784 - Mireya S. Garcia, Henri Nicolas:
Video object motion applications focusing on non-planar rotation. 785-788 - Yu-Kuang Tu, Jar-Ferr Yang, Yi-Nung Shen, Ming-Ting Sun:
Fast variable-size block motion estimation using merging procedure with an adaptive threshold. 789-792 - Hongbin Wang, Hua Lin:
A spectral clustering approach to motion segmentation based on motion trajectory. 793-796 - Korada Ramkishor, T. S. Raghu, K. Suman, Pallapothu S. S. B. K. Gupta:
Spatial correlation based fast field motion vector estimation algorithm for interlaced video encoding. 797-800 - Ye Lu, Cheng Lu, Ze-Nian Li:
A modified space frequency decomposition algorithm for visual motion. 801-804 - Sumeer Goel, Mohsen Shaaban, Tarek Darwish, Hanan A. Mahmoud, Magdy A. Bayoumi:
Memory accesses reduction for MIME algorithm. 805-808 - Yu-Wen Huang, Bing-Yu Hsieh, Tu-Chih Wang, Shao-Yi Chien, Shyh-Yih Ma, Chun-Fu Shen, Liang-Gee Chen:
Analysis and reduction of reference frames for motion estimation in MPEG-4 AVC/JVT/H.264. 809-812 - Shunan Lin, Anthony Vetro, Yao Wang:
Rate-distortion analysis of the multiple description motion compensation video coding scheme. 813-816
Design and Implementation of Signal Processing Systems
- Adel Baganne, Imed Bennour, Mehrez Elmarzougui, Eric Martin:
A simulation based approach for incorporating virtual components IP cores into multimedia systems design. 817-820 - Atsushi Hatabu, Takashi Miyazaki, Ichiro Kuroda:
Optimization of decision-timing for early termination of SSDA-based block matching. 821-824 - Xiaojuan Hu, Linda DeBrunner, Victor E. DeBrunner:
Design of space-efficient, wide- and narrow transition-band, FIR filters. 825-828 - Duy Cuong Nguyen, Parham Aarabi, Ali Sheikholeslami:
Real-time sound localization using field-programmable gate arrays. 829-832 - Sang Yoon Park, Nam Ik Cho:
Fixed point error analysis of CORDIC processor based on the variance propagation. 833-836 - Justin J. Song, Jian Li, Yen-Kuang Chen:
Quality-delay-and-computation trade-off analysis of acoustic echo cancellation on general-purpose CPU. 837-840 - Etienne Cornu, Hamid Sheikhzadeh, Robert L. Brennan, Hamid Reza Abutalebi, Edmund C. Y. Tam, Peter Iles, Kar Wai Wong:
ETSI AMR-2 VAD: evaluation and ultra low-resource implementation. 841-844 - Daisuke Takahashi:
A radix-16 FFT algorithm suitable for multiply-add instruction based on Goedecker method. 845-848 - Sung-Won Lee, In-Cheol Park:
Low-power hybrid structure of digital matched filters for direct sequence spread spectrum systems. 849-852
Volume 3
Theoretical Insights and Improvements for Multimodal Biometrics
- Conrad Sanderson, Samy Bengio, Hervé Bourlard, Johnny Mariéthoz, Ronan Collobert, Mohamed Faouzi BenZeghiba, Fabien Cardinaux, Sébastien Marcel:
Speech & face based biometric authentication at IDIAP. 1-4 - Julian Fiérrez-Aguilar, Javier Ortega-Garcia, Joaquin Gonzalez-Rodriguez:
Fusion strategies in multimodal biometric verification. 5-8 - Upendra V. Chaudhari, Ganesh N. Ramaswamy, Gerasimos Potamianos, Chalapathy Neti:
Information fusion and decision cascading for audio-visual speaker recognition based on time-varying stream reliability prediction. 9-12 - Xiaoguang Lu, Yunhong Wang, Anil K. Jain:
Combining classifiers for face recognition. 13-16 - Arslan Brömme:
A classification of biometric signatures. 17-20
Summarization
- Michael G. Christel, Chang Huang:
Enhanced access to digital video through visually rich interfaces. 21-24 - Berna Erol, Dar-Shyang Lee, Jonathan J. Hull:
Multimodal summarization of meeting recordings. 25-28 - Lexing Xie, Shih-Fu Chang, Ajay Divakaran, Huifang Sun:
Unsupervised discovery of multilevel statistical video structures using hierarchical hidden Markov models. 29-32 - Stefano Berretti, Alberto Del Bimbo, Pietro Pala:
Merging results of distributed image libraries. 33-36 - Rui Cai, Lie Lu, Hong-Jiang Zhang, Lian-Hong Cai:
Highlight sound effects detection in audio stream. 37-40
Multistream Audio and Video Processing for Telepresence
- Douglas L. Jones:
Four-dimensional sound source recovery from arbitrary acoustic arrays. 41-44 - Qiong Liu, Don Kimber, Jonathan Foote, Chunyuan Liao:
Multichannel video/audio acquisition for immersive conferencing. 45-48 - Wolfgang Herbordt, Herbert Buchner, Walter Kellermann, Rudolf Rabenstein, Sascha Spors, Heinz Teutsch:
Full-duplex multichannel communication: real-time implementations in a general framework. 49-52 - Parham Aarabi, Bob Mungamuru:
Scene reconstruction using distributed microphone arrays. 53-56 - Ankur Mohan, Ramani Duraiswami, Dmitry N. Zotkin, Daniel DeMenthon, Larry S. Davis:
Using computer vision to generate customized spatial audio. 57-60
Video/Image tracking
- Takashi Yamamoto, Rama Chellappa:
Shape and motion driven particle filtering for human body tracking. 61-64 - Karthik Hariharakrishnan, Dan Schonfeld, Philippe Raffy, Fathy Yassa:
Object tracking using adaptive block matching. 65-68 - Gabriel Tsechpenakis, Kostas Rapantzikos, Nicolas Tsapatsoulis, Stefanos D. Kollias:
Object tracking in clutter and partial occlusion through rule-driven utilization of Snakes. 69-72 - Ofer Miller, Ety Navon, Amir Averbuch:
Tracking of moving objects based on graph edges similarity. 73-76 - Hao Jiang, Mark S. Drew:
Shadow-resistant tracking in video. 77-80
Multimedia Security and Content Protection IV
- Adnan Abdul-Aziz Gutub, Mohammad K. Ibrahim:
High performance elliptic curve GF(2k) cryptoprocessor architecture for multimedia. 81-84 - Wei-Qi Yan, Mohan S. Kankanhalli:
Scrambling of engineering drawings. 85-88 - Mitsuru Kondo, Daigo Muramatsu, Masahiro Sasaki, Takashi Matsumoto:
Nonlinear separation of signature trajectories for on-line personal authentication. 89-92 - José Gabriel Rodríguez Carneiro Gomes, Mylène Christine Queiroz de Farias, Sanjit K. Mitra, Marco Carli:
An accurate billing mechanism for multimedia communications. 93-96 - Dipti Prasad Mukherjee, Subhamoy Maitra:
Robust buyer authentication scheme for multimedia object. 97-100 - Haiping Lu, Alex C. Kot, Susanto Rahardja:
Binary image watermarking through biased binarization. 101-104 - Suk Hwan Lee, Tae-Su Kim, Byung-Ju Kim, Seong Geun Kwon, Ki-Ryong Kwon, Kuhn-Il Lee:
3D polygonal meshes watermarking using normal vector distributions. 105-108 - Nut Taesombut, Vineet Kumar, Rishi Dubey, P. Venkat Rangan:
Secure registration protocol for media appliances in wireless home networks. 109-112
Human Movement and Face Analysis
- Naresh P. Cuntoor, Amit A. Kale, Rama Chellappa:
Combining multiple evidences for gait recognition. 113-116 - Richard D. Green, Ling Guan:
Tracking human movement patterns using particle filtering. 117-120 - Jian Li, Shaohua Kevin Zhou, Chandra Shekhar:
A comparison of subspace analysis for face recognition. 121-124 - Jianyu Wang, Wen Gao, Shiguang Shan, XiaoPeng Hu:
Facial feature tracking combining model-based and model-free method. 125-128 - Shaohua Kevin Zhou, Rama Chellappa:
Simultaneous tracking and recognition of human faces from video. 129-132 - Gang Pan, Zhaohui Wu, Yunhe Pan:
Automatic 3D face verification from range data. 133-136 - Heng Liu, Shengye Yan, Xilin Chen, Wen Gao:
Rotated face detection in color images using radial template (RT). 137-140 - Xiujuan Chai, Shiguang Shan, Wen Gao, Bo Cao:
Novel example-based shape learning for fast face alignment. 141-144 - Do-Hyung Kim, Jaeyeon Lee, Jung Soh, YunKoo Chung:
Real-time face verification using multiple feature combination and a support vector machine supervisor. 145-148 - Wen Gao, Shiguang Shan, Xiujuan Chai, Xiaowei Fu:
Virtual face image generation for illumination and pose insensitive face recognition. 149-152
Image and Video Coding and Analysis
- Chengjie Tu, Trac D. Tran, Jie Liang:
Error resilient pre-/post-filtering for DCT-based block coding systems. 153-156 - Aysegul Cuhadar, Sinan Tasdoken:
Multiple arbitrary shape ROI coding with zerotree based wavelet coders. 157-160 - Marie Babel, Olivier Déforges:
Lossless and lossy minimal redundancy pyramidal decomposition for scalable image compression technique. 161-164 - Jari Korhonen, Ye Wang:
Schemes for error resilient streaming of perceptually coded audio. 165-168 - Stefano Belfiore, Marco Grangetto, Enrico Magli, Gabriella Olmo:
Spatio-temporal video error concealment with perceptually optimized mode selection. 169-172 - Son Lam Phung, Douglas Chai, Abdesselam Bouzerdoum:
Adaptive skin segmentation in color images. 173-176 - Takuma Ishida, Shogo Muramatsu, Hisakazu Kikuchi, Tetsuro Kuge:
Invertible deinterlacing with variable coefficients and its lifting implementation. 177-180 - Namrata Vaswani, Amit K. Roy-Chowdhury, Rama Chellappa:
Statistical shape theory for activity modeling. 181-184 - John N. Carter, Pelopidas Lappas, Robert I. Damper:
Evidence-based object tracking via global energy maximization. 185-188 - Manoranjan Paul, M. Manzur Murshed, Laurence Dooley:
A new real-time pattern selection algorithm for very low bit-rate video coding focusing on moving regions. 189-192
Speech and Audio Processing IV
- Ye Wang, Jian Tang, Ali Ahmaniemi, Markus Vaalgamaa:
Parametric vector quantization for coding percussive sounds in music. 193-196 - Mukund Devarajan, Fansheng Meng, Penny Hix, Stephen A. Zahorian:
HMM-neural network monophone models for computer based articulation training for the hearing impaired. 197-200 - Suryakanth V. Gangashetty, C. Chandra Sekhar, B. Yegnanarayana:
Constraint satisfaction model for enhancement of evidence in recognition of consonant-vowel utterances. 201-204 - Daniel Garcia-Romero, Julian Fiérrez-Aguilar, Joaquin Gonzalez-Rodriguez, Javier Ortega-Garcia:
Support vector machine fusion of idiolectal and acoustic speaker information in Spanish conversational speech. 205-208 - Takanobu Nishiura, Masato Nakayama, Satoshi Nakamura:
An evaluation of adaptive beamformer based on average speech spectrum for noisy speech recognition. 209-212 - Jianhua Tao, Xing Ni:
Auditive learning based Chinese F0 prediction. 213-216 - Justinian P. Rosca, Radu V. Balan, Christophe Beaugeant:
Multi-channel psychoacoustically motivated speech enhancement. 217-220 - Jin Li:
A progressive to lossless embedded audio coder (PLEAC) with reversible modulated lapped transform. 221-224
Signal Processing and Testing in Multimodal Biometrics
- Frank Zoebisch, Claus Vielhauer:
A test tool to support brute-force online and offline signature forgery tests on mobile devices. 225-228 - Marios Savvides, Krithika Venkataramani, B. V. K. Vijaya Kumar:
Incremental updating of advanced correlation filters for biometric authentication systems. 229-232 - Ziyou Xiong, Yunqiang Chen, Roy Wang, Thomas S. Huang:
A real time automatic access control system based on face and eye corners detection, face recognition and speaker identification. 233-236 - Umut Uludag, Anil K. Jain:
Multimedia content protection via biometrics-based encryption. 237-240 - Enrico Grosso, Massimo Tistarelli:
On testing methods for biometric authentication. 241-244
Multimedia Coding and Transport
- Narasinha Kamat, Ju Wang, Jonathan C. L. Liu:
A delay-efficient rerouting scheme for VoIP traffic. 245-248 - Xiaofei Liao, Hai Jin:
A new cluster-based distributed video recorder server. 249-252 - Zhihua Chen, Bobby Bodenheimer, J. Fritz Barnes:
Extending progressive meshes for use over unreliable networks. 253-256 - Christian Bachmeir, Peter Tabery, Serdar Uzumcu, Eckehard G. Steinbach:
A scalable virtual programmable real-time testbed for rapid multimedia service creation and evaluation. 257-260 - Bulent Cavusoglu, Dan Schonfeld, Rashid Ansari:
Real-time adaptive forward error correction for MPEG-2 video communications over RTP networks. 261-264
Multimedia Standards
- Chun-Chuan Yang, Chih-Wen Tien, Yung-Chi Wang:
Modeling of the non-deterministic synchronization behaviors in SMIL2.0 documents. 265-268 - Zaher Aghbari, Akifumi Makinouchi:
Extending MPEG-7 description scheme of moving regions by the semantic visual-spatio-temporal relationships. 269-272 - Jason Lukasiak, David Stirling, Nick Harders, Shane Perrow:
Performance of MPEG-7 low level audio descriptors with compressed data. 273-276 - Yick Ming Yeung, Oscar C. Au, Andy Chang:
Efficient rate control technique for JPEG2000 image coding using priority scanning. 277-280 - Jae-Gon Kim, Yong Wang, Shih-Fu Chang:
Content-adaptive utility-based video adaptation. 281-284
Face Analysis and Modeling
- Haitao Wang, Hong Wei, Yangsheng Wang:
Face representation under different illumination conditions. 285-288 - A-Nasser Ansari, Mohamed Abdel-Mottaleb:
3D face modeling using two orthogonal views and a generic face model. 289-292 - Chong Luo, Tat-Seng Chua, Teck Khim Ng:
Face tracking in video with hybrid of Lucas-Kanade and condensation algorithm. 293-296 - Xin Fan, Qi Zhang, Dequn Liang, Ling Zhao:
Face image restoration based on statistical prior and image blur measure. 297-300 - Yao-Hong Tsai, Yea-Shuan Huang:
Fast hierarchical face detection. 301-304
Segmentation, Summarization, and Structuring
- Ichiro Ide, Hiroshi Mo, Norio Katayama, Shin'ichi Satoh:
Topic-based inter-video structuring of a large-scale news video corpus. 305-308 - Ewa Kijak, Guillaume Gravier, Patrick Gros, Lionel Oisel, Frédéric Bimbot:
HMM based structuring of tennis videos using visual and audio cues. 309-312 - Lionel Brunel, Pierre Mathieu:
Fast method of segmentation and indexing MPEG1-2 flow. 313-316 - Yue Zhang, Mario A. Nascimento, Osmar R. Zaïane:
Building image mosaics: an application of content-based image retrieval. 317-320 - Wenli Zhang, Xiaomeng Wu, Shunsuke Kamijo, Masao Sakauchi:
A proposal for a video content generation support system and its application. 321-324 - Yan Liu, John R. Kender:
Fast scene segmentation using multi-level feature selection. 325-328 - Jek Charlson So Yu, Mohan S. Kankanhalli, Philippe Mulhem:
Semantic video summarization in compressed domain MPEG video. 329-332 - Xingquan Zhu, Xindong Wu:
Sequential association mining for video summarization. 333-336 - Eliza Yingzi Du, Chein-I Chang, Paul D. Thouin:
An unsupervised approach to color video thresholding. 337-340 - Darren E. Butler, Sridha Sridharan, V. Michael Bove Jr.:
Real-time adaptive background segmentation. 341-344
Rate Control and Packet Classification for Transmission
- Enrico Masala, Juan Carlos De Martin:
Analysis-by-synthesis distortion computation for rate-distortion optimized multimedia streaming. 345-348 - Yuh-Ching Wang, Jin-Jang Leou:
A rate control scheme for H.26L video transmission. 349-352 - Mei-Ling Shyu, Shu-Ching Chen, Hongli Luo:
Ensuring fairness in multimedia multicast streaming with optimal rate allocation and client buffer utilization. 353-356 - S. R. Subramanya, Jagannathan Sarangapani, Mingsheng Peng:
A scheme for fair, rate-based end-to-end congestion control of multimedia traffic in packet switched networks. 357-360 - Chi-Wah Wong, Oscar C. Au, Bojun Meng, Hong-Kwai Lam:
Perceptual rate control for low-delay video communications. 361-364 - Mei-Ling Shyu, Shu-Ching Chen, Hongli Luo:
Per-class queue management and adaptive packet drop mechanism for multimedia networking. 365-368 - Davide Quaglia, Juan Carlos De Martin:
Adaptive packet classification for constant perceptual quality of service delivery of video streams over time-varying networks. 369-372 - Qiang Liu, Jenq-Neng Hwang:
End-to-end available bandwidth estimation and time measurement adjustment for multimedia QOS. 373-376 - Lifeng Zhao, C.-C. Jay Kuo:
Buffer-constrained R-D optimized rate control for video coding. 377-380
Audio Signal Processing
- Dmitry N. Zotkin, Shihab A. Shamma, Powen Ru, Ramani Duraiswami, Larry S. Davis:
Pitch and timbre manipulations using cortical representation of sound. 381-384 - Hsuan-Huei Shih, Shrikanth S. Narayanan, C.-C. Jay Kuo:
Multidimensional humming transcription using a statistical approach for query by humming systems. 385-388 - Arvindh Krishnaswamy:
Application of pitch tracking to South Indian classical music. 389-392 - Mohammed Raad, Alfred Mertins, Ian S. Burnett:
Scalable to lossless audio compression based on perceptual set partitioning in hierarchical trees (PSPIHT). 393-396 - Ziyou Xiong, Regunathan Radhakrishnan, Ajay Divakaran, Thomas S. Huang:
Comparing MFCC and MPEG-7 audio features for feature extraction, maximum likelihood HMM and entropic prior HMM for sports audio classification. 397-400 - Ziyou Xiong, Regunathan Radhakrishnan, Ajay Divakaran, Thomas S. Huang:
Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework. 401-404 - Lie Lu, Yi Mao, Liu Wenyin, Hong-Jiang Zhang:
Audio restoration by constrained audio texture synthesis. 405-408 - Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno:
Musical instrument identification based on F0-dependent multivariate normal distribution. 409-412 - Dreten De Koning, Werner Verhelst:
On psychoacoustic noise shaping for audio requantization. 413-416
Architecture, Implementation, and Design
- Nicolas Ventroux, Jean-François Nezan, Mickaël Raulet, Olivier Déforges:
Rapid prototyping for an optimized MPEG4 decoder implementation over a parallel heterogeneous architecture. 417-420 - Hung-Chi Fang, Tu-Chih Wang, Yu-Wei Chang, Liang-Gee Chen:
Hardware oriented rate control algorithm and implementation for realtime video coding. 421-424 - Ho-Man Tang, Michael R. Lyu, Irwin King:
Face recognition committee machine. 425-428 - Shantanu Chakrabartty, Masakazu Yagi, Tadashi Shibata, Gert Cauwenberghs:
Robust cephalometric landmark identification using support vector machines. 429-432 - Richard Kuehnel, Yuke Wang:
A method of generating uniformly distributed sequences over [0, K], where K+1 is not a power of two. 433-436 - Anand Krishnamurthy, Yiyan Tang, Cathy Xu, Yuke Wang:
An efficient implementation of multi-prime RSA on DSP processor. 437-440 - Donglai Xu, Rui Gao, Hadj Batatia:
An improved parallel architecture fro MPEG-4 motion estimation in 3G mobile applications. 441-444 - Toshiyuki Yamane, Yasunao Katayama:
An ultra-fast Reed-Solomon decoder soft-IP with 8-error correcting capability. 445-448
Multimedia Technology in Bioinformatics
- Zuyi Wang, Sun-Yuan Kung, Junying Zhang, Javed I. Khan, Jianhua Xuan, Yue Joseph Wang:
Computational intelligence approach for gene expression data mining and classification. 449-452 - Harry Hochheiser, Eric H. Baehrecke, Stephen M. Mount, Ben Shneiderman:
Dynamic querying for pattern identification in microarray and genomic data. 453-456 - Sophia R. He, Edmond J. Breen, Sybille M. N. Hunt:
Proteomics: approaches and image analysis tools for drug discovery. 457-460 - Jinwook Seo, Marina Bakay, Po Zhao, Yi-Wen Chen, Priscilla Clarkson, Ben Shneiderman, Eric P. Hoffman:
Interactive color mosaic and dendrogram displays for signal/noise optimization in microarray data analysis. 461-464 - Per B. Hojte, Xiaoxing Wang:
Registering electrophoresis images for bioinformatics study of protein. 465-468
Video Analysis and Mining
- Dong-Jun Lan, Yufei Ma, Hong-Jiang Zhang:
A novel motion-based representation for video mining. 469-472 - Belle L. Tseng, Ching-Yung Lin, DongQing Zhang, John R. Smith:
Improved text overlay detection in videos using a fusion-based classifier. 473-476 - Chih-Yi Chiu, Shih-Pin Chao, Jui-Hsiang Chao, Wen-Yen Chang, Hsin-Chih Lin, Shi-Nine Yang:
Motion indexing and synthesis. 477-480 - Cees G. M. Snoek, Marcel Worring:
Time interval maximum entropy based event indexing in soccer video. 481-484 - Li-Qun Xu, Yongmin Li:
Video classification using spatial-temporal features and PCA. 485-488
Multimedia Computing Systems and Appliances
- Ju Wang, Jonathan C. L. Liu, Yishu He:
Efficient buffering control for a software-only, high-level, high-profile, MPEG-2 decoder. 489-492 - Yan Zhu, Min-You Wu, Wei Shu:
Comparison study and evaluation of overlay multicast networks. 493-496 - Yoshitaka Nakamura, Hirozumi Yamaguchi, Akihito Hiromori, Keiichi Yasumoto, Teruo Higashino, Kenichi Taniguchi:
On designing end-user multicast for multiple video sources. 497-500 - Eugenio Costamagna, Lorenzo Favalli, Francesco Tarantola:
Characterization and modeling of campus-level IP network traffic. 501-504 - Stuart Goose, Rajanikanth Tanikella, Sreedhar Kodlahalli:
Attenuator: towards preserving the original appearance of large documents when rendered on small screen mobile devices. 505-508
Fast Algorithm for Video Processing
- Keman Yu, Jiangbo Lu, Jiang Li, Shipeng Li:
Practical real-time video codec for mobile devices. 509-512 - Hyungjoon Kim, Yucel Altunbasak:
Low-complexity rate-distortion optimal macroblock mode selection for MPEG-like video coders. 513-516 - Hye-Yeon C. Tourapis, Alexis M. Tourapis:
Fast motion estimation within the H.264 codec. 517-520 - Bojun Meng, Oscar C. Au, Chi-Wah Wong, Hong-Kwai Lam:
Efficient intra-prediction mode selection for 4×4 blocks in H.264. 521-524 - Jun Xin, Ming-Ting Sun, Vincent Hsu:
Diversity-based fast block motion estimation. 525-528
Multimedia Human-Machine Interface and Interaction
- Yao-Jen Chang, Chao-Kuei Hsieh, Pei-Wei Hsu, Yung-Chang Chen:
Speech-assisted facial expression analysis and synthesis for virtual conferencing systems. 529-532 - Ashish Verma, Nitendra Rajput, L. Venkata Subramaniam:
Using viseme based acoustic models for speech driven lip synthesis. 533-536 - Atsuo Yoshitaka, Hirokazu Seki:
Detecting auditory information in concentration based on eye movement. 537-540 - Martin Zobl, Michael Geiger, Björn W. Schuller, Manfred K. Lang, Gerhard Rigoll:
A real-time system for hand gesture controlled operation of in-car devices. 541-544 - Olivier Pietquin, Thierry Dutoit:
Aided design of finite-state dialogue management systems. 545-548 - Laurence Devillers, Lori Lamel, Ioana Vasilescu:
Emotion detection in task-oriented spoken dialogues. 549-552 - Nils Klarlund:
Editing by voice and the role of sequential symbol systems for improved human-to-computer information rates. 553-556 - Amarnag Subramanya, Raghunandan S. Kumaran, John N. Gowdy:
Real time eye tracking for human computer interfaces. 557-560 - Alper Kanak, Engin Erzin, Yucel Yemez, A. Murat Tekalp:
Joint audio-video processing for biometric speaker identification. 561-564 - Ying Li, Shrikanth S. Narayanan, C.-C. Jay Kuo:
Audiovisual-based adaptive speaker identification. 565-568
Algorithms and Architectures for Multimedia Communcations
- Sumit Roy, John Ankcorn, Susie J. Wee:
Architecture of a modular streaming media server for content delivery networks. 569-572 - Hideaki Ito, Teruo Fukumura:
A delivery method of videos with required minimum bandwidths. 573-576 - Shiang-Chun Liou, Hsuan-Chia Lu, Kuo-Hsien Yeh:
A capable location prediction and resource reservation scheme in wireless networks for multimedia. 577-580 - Yen-Chi Lee, Yucel Altunbasak, Russell M. Mersereau:
A drift-free motion-compensated predictive encoding technique for multiple description coding. 581-584 - Enrico Magli, Massimo Mancin, Luca Merello:
Low-complexity video compression for wireless sensor networks. 585-588 - Shuhua Peng, Xiaodong Liu, Qionghai Dai, Yu Cheng:
An improved RM algorithm for preventing streaming media tasks from starvation. 589-592 - Gaurav Harit, Santanu Chaudhury, Gaurav Garg, Pramod Kumar Sharma:
A framework for video representation and transcoding using appearance spaces. 593-596 - Andrea Cavallaro, Olivier Steiger, Touradj Ebrahimi:
Semantic segmentation and description for video transcoding. 597-600 - Tu-Chih Wang, Yu-Wen Huang, Hung-Chi Fang, Liang-Gee Chen:
Performance analysis of hardware oriented algorithm modification in H.264. 601-604
Speech Recognition and Enhancement
- Ashutosh Garg, Gerasimos Potamianos, Chalapathy Neti, Thomas S. Huang:
Frame-dependent multi-stream reliability indicators for audio-visual speech recognition. 605-608 - Hideki Banno, Tetsuya Shinde, Kazuya Takeda, Fumitada Itakura:
In-car speech recognition using distributed microphones: adapting to automatically detected driving conditions. 609-612 - LiFeng Sang, Zhaohui Wu, Yingchun Yang, Wanfeng Zhang:
Automatic speaker recognition using dynamic Bayesian network. 613-616 - Phu Chien Nguyen, Masato Akagi, Tu Bao Ho:
Temporal decomposition: a promising approach to VQ-based speaker identification. 617-620 - Guillaume Lathoud, Iain McCowan:
Location based speaker segmentation. 621-624 - Shoichi Matsunaga, Atsunori Ogawa, Yoshikazu Yamaguchi, Akihiro Imamura:
Non-native English speech recognition using bilingual English lexicon and acoustic models. 625-628 - Guangji Shi, Parham Aarabi:
Robust digit recognition using phase-dependent time-frequency masking. 629-632 - Jounghoon Beh, Hanseok Ko:
A novel spectral subtraction scheme for robust speech recognition: spectral subtraction using spectral harmonics of speech. 633-636
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.