Shot detection in HEVC/H.265 video

Document Type : Original Article


1 Department of Computer, Shiraz Branch, Islamic Azad University, Shiraz, Iran

2 Department of computer engineering, Shiraz branch,, Islamic Azad university, Shiraz, Iran.

3 School of Electrical and Computer Engineering, University of Tehran, North Kargar St., Tehran, Iran


The huge number and volume of video and video usage have caused most of them saved and transferred as compressed video. Nowadays, indexing, searching and retrieving video directly in compressed domain has been taken great attention. The first step in video indexing and retrieval is segmenting a video into chronological sets and manageable pieces or shots. In recent years, a new video coding standard say H.265 has been introduced and it is needed to develop the methods and algorithms for analyzing, indexing and retrieving H.265 compressed video. In this paper, a novel method has been proposed for shot detection in H.265 compressed video without full decompression. In the proposed method, macro block coding information which is in headers of compressed H.265 bit-stream by a threshold is used for shot detection of compressed video. The results indicate that the proposed method detect video shots in H.265 video by 80% accuracy and 84/3% recall.


[1] Priya and T. Shanmugam, "A comprehensive review of significant researches on content based indexing and retrieval of visual information," Frontiers of Computer Science, vol. 7, no. 5, pp. 782-799, 2013.
[2] Umamakeswari and A. Rajaraman, "Object based video analysis, interpretation and tracking," Journal of Computer Science, vol. 3, no. 10, pp. 818-822, 2007.
[3] H. Abdulhussain, A. R. Ramli, M. I. Saripan, B. M. Mahmmod, S. A. R. Al-Haddad, and W. A. Jassim, "Methods and challenges in shot boundary detection: a review," Entropy, vol. 20, no. 4, p. 214, 2018.
[4] Del Fabro and L. Böszörmenyi, "State-of-the-art and future challenges in video scene detection: a survey," Multimedia systems, vol. 19, no. 5, pp. 427-454, 2013.
[5] González-Díaz, T. Martínez-Cortés, A. Gallardo-Antolín, and F. Díaz-de-María, "Temporal segmentation and keyframe selection methods for user-generated video search-based annotation," Expert Systems with Applications, vol. 42, no. 1, pp. 488-502, 2015.
[6] B. Fayk, H. A. El Nemr, and M. M. Moussa, "Particle swarm optimisation based video abstraction," Journal of Advanced Research, vol. 1, no. 2, pp. 163-167, 2010.
[7] Parmar and M. C. Angelides, "MAC-REALM: a video content feature extraction and modelling framework," The Computer Journal, vol. 58, no. 9, pp. 2135-2171, 2015.
[8] Ding and G. Fan, "Camera view-based american football video analysis," in Eighth IEEE International Symposium on Multimedia (ISM'06), 2006: IEEE, pp. 317-322.
[9] Hu, N. Xie, L. Li, X. Zeng, and S. Maybank, "A survey on visual content-based video indexing and retrieval," IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), vol. 41, no. 6, pp. 797-819, 2011.
[10] Midya and S. Sengupta, "Switchable video error concealment using encoder driven scene transition detection and edge preserving SEC," Multimedia Tools and Applications, vol. 74, no. 6, pp. 2033-2054, 2015.
[11] Liu and J. R. Kender, "Computational approaches to temporal sampling of video sequences," ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), vol. 3, no. 2, p. 7, 2007.
[12] Trichet, R. Nevatia, and B. Burns, "Video event classification with temporal partitioning," in 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), 2015: IEEE, pp. 1-6.
[13] Jiang, G. Zhang, H. Wang, and H. Bao, "Spatio-temporal video segmentation of static scenes and its applications," IEEE Transactions on Multimedia, vol. 17, no. 1, pp. 3-15, 2014.
[14] Choroś, "Improved video scene detection using player detection methods in temporally aggregated TV sports news," in International Conference on Computational Collective Intelligence, 2014: Springer, pp. 633-643.
[15] C. Oprea, R. O. Preda, I. Pirnog, and R. A. Dobre, "Video Shot Boundary Detection for Low Complexity HEVC Encoders," in 2018 10th International Conference on Electronics, Computers and Artificial Intelligence (ECAI), 2018: IEEE, pp. 1-4.
[16] -M. Lu and Y. Shi, "Fast video shot boundary detection based on SVD and pattern matching," IEEE Transactions on Image processing, vol. 22, no. 12, pp. 5136-5145, 2013.
[17] Choroś, "False and miss detections in temporal segmentation of TV sports news videos–causes and remedies," in New Research in Multimedia and Internet Systems: Springer, 2015, pp. 35-46.
[18] H. Iwan and J. A. Thom, "Temporal video segmentation: detecting the end-of-act in circus performance videos," Multimedia Tools and Applications, vol. 76, no. 1, pp. 1379-1401, 2017.
[19] Dutta, S. K. Saha, and B. Chanda, "A shot detection technique using linear regression of shot transition pattern," Multimedia Tools and Applications, vol. 75, no. 1, pp. 93-113, 2016.
[20] Richardson, "An introduction to high efficiency video coding VCODEX video compression," ed, 2013.
[21] Wang, A. Divakaran, A. Vetro, S.-F. Chang, and H. Sun, "Survey of compressed-domain features used in audio-visual indexing and analysis," Journal of Visual Communication and Image Representation, vol. 14, no. 2, pp. 150-183, 2003.
[22] Dorfeshan, M. Ramezanpour, " Compressed Domain Scene Change Detection Based on Transform Units Distribution in High Efficiency Video Coding Standard," Journal of Computer & Robotics, Sep 18;11(2):41-8, 2018
[23] V. Babu, M. Tom, and P. Wadekar, "A survey on compressed domain video analysis techniques," Multimedia Tools and Applications, vol. 75, no. 2, pp. 1043-1078, 2016.
[24] Mishra and S. Singhai, "A Review on Different Methods of” Video Shot Boundary Detection”," International Journal of Managment, IT and Engineering, vol. 2, no. 9, pp. 199-214, 2012.
[25] Zargari, M. Mehrabi, and M. Ghanbari, "A robust compressed domain feature vector for texture based image retrieval," in 2008 International Workshop on Content-Based Multimedia Indexing, 2008: IEEE, pp. 489-495.
[26] Zhang, C. Y. Low, and S. W. Smoliar, "Video parsing and browsing using compressed data," Multimedia tools and applications, vol. 1, no. 1, pp. 89-111, 1995.
[27] Meng, Y. Juan, and S.-F. Chang, "Scene change detection in an MPEG-compressed video sequence," in Digital Video Compression: Algorithms and Technologies 1995, 1995, vol. 2419: International Society for Optics and Photonics, pp. 14-25.
[28] Feng, K.-T. Lo, and H. Mehrpour, "Scene change detection algorithm for MPEG video sequence," in Proceedings of 3rd IEEE International Conference on Image Processing, 1996, vol. 2: IEEE, pp. 821-824.
[29] Koprinska and S. Carraio, "Detecting and classifying video shot boundaries in MPEG compressed sequences," in 9th European Signal Processing Conference (EUSIPCO 1998), 1998: IEEE, pp. 1-4.
[30] Zabih, J. Miller, and K. Mai, "A feature-based algorithm for detecting and classifying production effects," Multimedia systems, vol. 7, no. 2, pp. 119-128, 1999.
[31] Akrami and F. Zargari, "An efficient compressed domain video indexing method," Multimedia tools and applications, vol. 72, no. 1, pp. 705-721, 2014.
[32] Biswas and R. V. Babu, "H. 264 compressed video classification using histogram of oriented motion vectors (HOMV)," in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 2013: IEEE, pp. 2040-2044.
[33] V. Babu and K. Ramakrishnan, "Compressed domain video retrieval using object and global motion descriptors," Multimedia Tools and Applications, vol. 32, no. 1, pp. 93-113, 2007.
[34] -L. Yeo and B. Liu, "On the extraction of DC sequence from MPEG compressed video," in Proceedings., International Conference on Image Processing, 1995, vol. 2: IEEE, pp. 260-263.
[35] A. Hasan, M. Xu, X. He, and Y. Wang, "A camera motion histogram descriptor for video shot classification," Multimedia Tools and Applications, vol. 74, no. 24, pp. 11073-11098, 2015.
[37] A. Elrowayati, M. F. L. Abdullah, A. A. Manaf, and A. S. Alfagi, "Tampering detection of double-compression with the same quantization parameter in HEVC video streams," in 2017 7th IEEE International Conference on Control System, Computing and Engineering (ICCSCE), 2017: IEEE, pp. 174-179.
[38] -Y. Huang, M.-F. Lin, and P.-c. Chang, "Transcoding or not? —A study of quantization configuration for H. 264-to-HEVC transcoding," in 2017 IEEE 6th Global Conference on Consumer Electronics (GCCE), 2017: IEEE, pp. 1-2.
[39] Zargari, M. Mehrabi, and M. Ghanbari, "Compressed domain texture based visual information retrieval method for I-frame coded pictures," IEEE Transactions on Consumer Electronics, vol. 56, no. 2, pp. 728-736, 2010.
[40] Sze and M. Budagavi, "High throughput CABAC entropy coding in HEVC," IEEE Transactions on Circuits and Systems for Video Technology, vol. 22, no. 12, pp. 1778-1791, 2012.
[41] Chattopadhyay, A. Chaki, and U. Garain, "A fast method for detection of video shot boundaries using compressed domain features of H. 264 for PVR enabled Set Top Boxes: A novel approach," in 2009 First International Conference on Computational Intelligence, Communication Systems and Networks, 2009: IEEE, pp. 6-11.
[42] Sze, M. Budagavi, and G. J. Sullivan, "High efficiency video coding (HEVC)," in Integrated circuit and systems, algorithms and architectures, vol. 39: Springer, 2014, p. 40.
[43] Wien, "High efficiency video coding," Coding Tools and specification, pp. 133-160, 2015.
[44] De la Torre, R. Rodriguez-Sanchez, and J. L. Martínez, "Fast video transcoding from HEVC to VP9," IEEE Transactions on Consumer Electronics, vol. 61, no. 3, pp. 336-343, 2015.