A Hybrid Deep Hashing and Metric Space Partitioning Framework for Scalable Content-Based Image Retrieval via Unsupervised Representation Learning and VP-Tree Optimization

Mohamadzadeh, Sajad; Gharehbagh, Mohammad

doi:10.22061/jecei.2025.11879.839

Document Type : Original Research Paper

Authors

Department of Electrical and Computer Engineering, University of Birjand, Birjand, Iran.

https://doi.org/10.22061/jecei.2025.11879.839

Abstract

Background and Objectives: Content-Based Image Retrieval (CBIR) systems are crucial for managing the exponential growth of digital imagery. Traditional methods relying on handcrafted features often fail to scale and capture semantic content. Although deep learning enhances retrieval quality, challenges persist in computational complexity and efficiency. This paper introduces a hybrid CBIR framework that combines unsupervised deep feature learning, adaptive hashing, and VP-Tree-based hierarchical search optimization. The proposed system, evaluated on CIFAR-10, ImageNet subset, and a custom medical imaging dataset, achieves a mean average precision (mAP) of 96.1% and reduces retrieval latency by approximately 40% compared to conventional methods. By leveraging autoencoder-driven latent feature extraction and scalable metric space partitioning, our framework demonstrates superior performance in scalability, retrieval speed, and accuracy for large-scale applications.
Methods: The proposed framework employs autoencoder-driven latent space encoding to extract compact yet semantically rich feature representations, ensuring robust discriminability across diverse image categories. To enhance retrieval efficiency, a hybrid search mechanism is implemented: a Euclidean-based nearest neighbor scheme O(N log N) is used for moderate-scale datasets, while a VP-Tree-based hashing scheme O(log N) is applied for large-scale retrieval scenarios. By leveraging hierarchical metric space partitioning, the method significantly reduces search complexity while maintaining retrieval accuracy.
Results: Extensive evaluations show the proposed framework outperforms traditional and modern deep hashing techniques, achieving higher mean average precision, lower search latency, and better storage efficiency for both moderate and large-scale datasets. By integrating unsupervised representation learning, advanced hashing, and optimized search structures, the system surpasses conventional methods in speed and precision.
Conclusion: This study presents a highly scalable and computationally efficient CBIR framework that addresses the limitations of existing methods by combining unsupervised deep feature learning, adaptive hashing, and hierarchical search structures. The results highlight the framework's ability to achieving high retrieval accuracy and efficiency, thus making it suitable for real-time applications in large-scale multimedia repositories.

Keywords

Main Subjects

Image Annotation and Retrieval

Open Access

This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit: http://creativecommons.org/licenses/by/4.0/

Publisher’s Note

JECEI Publisher remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Publisher

Shahid Rajaee Teacher Training University

References

[1] X. Zhang, “A survey on deep hashing for image retrieval,” arXiv preprint arXiv:2006.05627, 2020.

[2] A. Sezavar, H. Farsi, S. Mohamadzadeh, “Content-based image retrieval by combining convolutional neural networks and Sparse representation,” Multimedia Tools Appl., 78(15): 20895–20912, 2019.

[3] A. Latif et al., “Content-based image retrieval and feature extraction: a comprehensive review,” Math. Probl. Eng., 2019: 9658350, 2019.

[4] X. Wang, P. Liu, M. Chen, “Self-supervised learning for medical image analysis using image context restoration,” Med. Image Anal., 67: 101854, 2021.

[5] I. Markov, “VP-tree: Content-based image indexing,” Presented at the Spring Young Researcher’s Colloquium On Database and Information Systems (SYRCoDIS), Moscow, Russia, 2007.

[6] A. Sezavar, H. Farsi, S. Mohamadzadeh, “A modified grasshopper optimization algorithm combined with CNN for content-based image retrieval,” Int. J. Eng., 32(7): 924-930, 2019.

[7] S. Ndung’u, T. Grobler, S. J. Wijnholds, G. Azzopardi, “Content-based image retrieval using COSFIRE descriptors with application to radio astronomy,” Mon. Not. R. Astron. Soc., 537(4): 3286-3297, 2025.

[8] A. Bozdag, M. Yildirim, M. Karaduman, A. Aksoy, “Detection of gallbladder disease types using a feature engineering-based developed CBIR system,” Diagnostics, 15(5): 552, 2025.

[9] X. Wang, P. Liu, M. Chen, “Self-supervised representation learning for image retrieval,” Comput. Vis. Image Underst., 218: 104937, 2024.

[10] G. Gombos, J. M. Szalai-Gindl, I. Donkó, A. Kiss, "Towards an experimental comparison of the M-Tree index structure with BK-Tree and VP-Tree," Acta Electrotechnica et Informatica, 20(2): 19-26, 2020.

[11] T. Khalil, M. U. Akram, H. Raja, A. Jameel, I. Basit, "Detection of glaucoma using cup to disc ratio from spectral domain optical coherence tomography images," IEEE Access, 6: 4560–4576, 2018.

[12] S. Mohamadzadeh, S. Pasban, J. Zeraatkar-Moghadam, et al., “Parkinson’s disease detection by using feature selection and sparse representation,” J. Med. Biol. Eng., 41: 412–421, 2021.

[13] F. Karsdorp, P. van Kranenburg, E. Manjavacas, "Learning similarity metrics for melody retrieval," in Proc. the 20th International Society for Music Information Retrieval Conference: 478–485, 2019.

[14] G. Gombos, I. Donkó, J. M. Szalai-Gindl, "Source Code of the M-Tree Index," 2020.

[15] S. Yang, L. Li, S. Wang, W. Zhang, Q. Huang, Q. Tian, "SkeletonNet: A hybrid network with a skeleton-embedding process for multi-view image representation learning," IEEE Trans. Multimedia, 21(11): 2916-2929, 2019.

[16] C. Celik, H. S. Bilge, "Content-based image retrieval with sparse representations and local feature descriptors: A comparative study," Pattern Recognit., 68: 1-13, 2017.

[17] T. Khalil, M. Usman Akram, S. Khalid, A. Jameel, "Improved automated detection of glaucoma from fundus image using hybrid structural and textural features," IET Image Process., 11(9): 693-700, 2017.

[18] S. Susan, P. Agrawal, M. Mittal, S. Bansal, "New shape descriptor in the context of edge continuity," CAAI Trans. Intell. Technol., 4(2): 101-109, 2019.

[19] R. Ashraf, M. Ahmed, U. Ahmad, M. A. Habib, S. Jabbar, K. Naseer, “MDCBIR-MF: Multimedia data for content-based image retrieval by using multiple features,” Multimedia Tools Appl., 79(13–14), 8553-8579, 2020.

[20] W. Zhou, H. Li, J. Sun, Q. Tian, “Collaborative index embedding for image retrieval,” IEEE Trans. Pattern Anal. Mach. Intell., 40(5): 1154-1166, 2018.

[21] G. Ioannakis, A. Koutsoudis, I. Pratikakis, C. Chamzas, “Retrieval: An online performance evaluation tool for information retrieval methods,” IEEE Trans. Multimedia, 20: 119-127, 2017.

[22] I. Donkó, J. M. Szalai-Gindl, G. Gombos, A. Kiss, "An implementation of the M-Tree index structure for PostgreSQL using GIST," in Proc. 2019 IEEE 15th International Scientific Conference on Informatics: 6, Poprad, Slovakia, Nov. 2019.

[23] L. Amelio, R. Janković, A. Amelio, “A new dissimilarity measure for clustering with application to dermoscopic images,” in Proc. the 2018 9th International Conference on Information, Intelligence, Systems and Applications (IISA): 1–8, 2018.

[24] W. Zhao, L. Yan, Y. Zhang, "Geometric-constrained multi-view image matching method based on semi-global optimization," Geo-Spatial Inf. Sci., 21(2): 115-126, 2018.

[25] N. Ali, B. Zafar, M. K. Iqbal, "Modeling global geometric spatial information for rotation invariant classification of satellite images," PLoS One, 14(7): e0219833, 2019.

[26] R. Ashraf, M. Ahmed, S. Jabbar, "Content-based image retrieval by using color descriptor and discrete wavelet transform," J. Med. Syst., 42(3): 44, 2018.

[27] W. Zhou, H. Li, Q. Tian, “Recent advance in content-based image retrieval: A literature survey,” arXiv preprint arXiv:1706.06064, 2017.

[28] Y. Mistry, D. Ingole, M. Ingole, "Content-based image retrieval using hybrid features and various distance metric," J. Electr. Syst. Inf. Technol., 5(3): 878-888, 2017.

[29] I. J. Sumana, G. Lu, D. Zhang, "Comparison of curvelet and wavelet texture features for content based image retrieval," in 2012 IEEE International Conference on Multimedia and Expo (ICME): 290-295, 2012.

[30] P. Bhatt, S. Patel, A. Shah et al, “Image enhancement using various interpolation methods,” Int. J. Comput. Sci. Inf. Technol. Secur., 2(4): 799-803, 2012.

LETTERS TO EDITOR

Journal of Electrical and Computer Engineering Innovations (JECEI) welcomes letters to the editor for the post-publication discussions and corrections which allows debate post publication on its site, through the Letters to Editor. Letters pertaining to manuscript published in JECEI should be sent to the editorial office of JECEI within three months of either online publication or before printed publication, except for critiques of original research. Following points are to be considering before sending the letters (comments) to the editor.

[1] Letters that include statements of statistics, facts, research, or theories should include appropriate references, although more than three are discouraged.

[2] Letters that are personal attacks on an author rather than thoughtful criticism of the author’s ideas will not be considered for publication.

[3] Letters can be no more than 300 words in length.

[4] Letter writers should include a statement at the beginning of the letter stating that it is being submitted either for publication or not.

[5] Anonymous letters will not be considered.

[6] Letter writers must include their city and state of residence or work.

[7] Letters will be edited for clarity and length.

Name *

Email Address *

Affiliation *

Comments *

Security Code *

Journal of Electrical and Computer Engineering Innovations (JECEI)

A Hybrid Deep Hashing and Metric Space Partitioning Framework for Scalable Content-Based Image Retrieval via Unsupervised Representation Learning and VP-Tree Optimization

References

References

Send comment about this article

Volume 14, Issue 2
July 2026
Pages 337-350

A Hybrid Deep Hashing and Metric Space Partitioning Framework for Scalable Content-Based Image Retrieval via Unsupervised Representation Learning and VP-Tree Optimization

References

References

Send comment about this article

Volume 14, Issue 2July 2026Pages 337-350

Volume 14, Issue 2
July 2026
Pages 337-350