Constructing a five-dimensional framework for traditional village building protection in China: A BERTopic bibliometric analysis linked to rural tourism
Protecting traditional village buildings is a global challenge, as it relates not only to cultural identity but also to the sustainable development of rural tourism. Establishing a unified protection system has become an important objective in the field of cultural heritage. This study applies a bibliometric approach using the BERTopic model to analyze 602 academic papers published between 1999 and 2024 in the China National Knowledge Infrastructure and Web of Science databases. The analysis provides quantitative evidence for the integrated development of traditional building protection and rural tourism. Topic evolution analysis based on the BERTopic model reveals that research on traditional village buildings has progressed from building culture to material technology, then to function adaptation, community participation, and finally to digital transformation. This trajectory reflects a shift from static preservation to dynamic regeneration and from a focus on physical space to social relationships. Using topic similarity analysis and semantic clustering, nine original topics were consolidated into five main dimensions—building features, materials and construction, spatial and functional adaptability, building culture, and social culture—forming a five-dimensional framework for evaluating building protection. This framework has theoretical depth and potential for policy use. This study advances beyond traditional qualitative approaches by proposing a quantitative model of semantic modeling, topic evolution, and framework construction, offering structured and visualized data to support theoretical understanding and policy development in cultural heritage protection and rural tourism.
Aktürk, İ. (2025). Practices for the protection of rural architecture: The case of Serbian rural area. International Journal of Research and Innovation in Social Science, IX(I), 1283–1288. https://doi.org/10.47772/IJRISS.2025.9010108
Arias Tapiero, J. C., Graus, S., Khei, S., Silva, D., Conde, O., Ferreira, T. M., Ortega, J., Luso, E., Rodrigues, H., & Vasconcelos, G. (2025). An ICT-enhanced methodology for the characterization of vernacular built heritage at a regional scale. International Journal of Architectural Heritage, 19(6), 966–984. https://doi.org/10.1080/15583058.2024.2320862
Barber, D. M., Dallas, R. W. A., & Mills, J. P. (2006). Laser scanning for architectural conservation. Journal of Architectural Conservation, 12(1), 35–52. https://doi.org/10.1080/13556207.2006.10784959
Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent Dirichlet allocation. The Journal of Machine Learning Research, 3, 993–1022.
Cano, M., Garzón, E., & Sánchez-Soto, P. J. (2013). Preservation and conservation of rural buildings as a subject of cultural tourism: A review concerning the application of new technologies and methodologies. Journal of Tourism & Hospitality, 02(02). https://doi.org/10.4172/2167-0269.1000115
Cao, K., Liu, Y., Cao, Y., Wang, J., & Tian, Y. (2024). Construction and characteristic analysis of landscape gene maps of traditional villages along ancient Qin-Shu roads, Western China. Heritage Science, 12(1), 37. https://doi.org/10.1186/s40494-024-01155-y
Cattaneo, T., Giorgi, E., & Ni, M. (2018). Landscape, architecture and environmental regeneration: A research by design approach for inclusive tourism in a rural village in China. Sustainability, 11(1), 128. https://doi.org/10.3390/su11010128
Chen, Y., Yuan, J., & Lu, Q. (2019). Study on the usability of residential buildings in traditional villages in Southern China from the perspective of human settlements. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 11585 LNCS, 14–28. https://doi.org/10.1007/978-3-030-23538-3_2
Djabarouti, J., & Ren, Y. (2024). Cultural convergence in heritage landscape conservation: A comparative study of Chinese and English traditions. Arts & Communication, 2(1), 1923. https://doi.org/10.36922/ac.1923
Douglas, J. (2006). Building adaptation (2nd ed.). Butterworth-Heinemann.
Du, X., & Shi, D. (2019). Rural heritage: Value, conservation and revitalisation—From the perspective of the human-land relationship. Built Heritage, 3(2), 1–6. https://doi.org/10.1186/BF03545723
Egger, R., & Yu, J. (2022). A topic modeling comparison between LDA, NMF, Top2Vec, and BERTopic to demystify Twitter posts. Frontiers in Sociology, 7, 886498. https://doi.org/10.3389/fsoc.2022.886498
Feng, H., & Xiao, J. (2020). Dynamic authenticity: Understanding and conserving Mosuo dwellings in China in transitions. Sustainability, 13(1), 143. https://doi.org/10.3390/su13010143
Gao, J., & Wu, B. (2017). Revitalizing traditional villages through rural tourism: A case study of Yuanjia Village, Shaanxi Province, China. Tourism Management, 63, 223–233. https://doi.org/10.1016/j.tourman.2017.04.003
González Martínez, P. (2019). From verifiable authenticity to verisimilar interventions: Xintiandi, Fuxing SOHO, and the alternatives to built heritage conservation in Shanghai. International Journal of Heritage Studies, 25(10), 1055–1072. https://doi.org/10.1080/13527258.2018.1557235
Grootendorst, M. (2022). BERTopic: Neural topic modeling with a class-based TF-IDF procedure. arXiv Preprint. https://doi.org/10.48550/ARXIV.2203.05794
Harth, A. (2024). X-ray fluorescence (XRF) on painted heritage objects: A review using topic modeling. Heritage Science, 12(1), 17. https://doi.org/10.1186/s40494-024-01135-2
Hellwig, N. C., Fehle, J., Bink, M., Schmidt, T., & Wolff, C. (2024). Exploring Twitter discourse with BERTopic: Topic modelling of tweets related to the major German parties during the 2021 German federal election. International Journal of Speech Technology, 27(4), 901–921. https://doi.org/10.1007/s10772-024-10142-4
Hu, C., & Gong, C. (2017). Exploring the creation of ecological historic district through comparing and analyzing four typical revitalized historic districts. Energy Procedia, 115, 308–320. https://doi.org/10.1016/j.egypro.2017.05.028
Hu, H., Qiao, X., Yang, Y., & Zhang, L. (2021). Developing a resilience evaluation index for cultural heritage site: Case study of Jiangwan Town in China. Asia Pacific Journal of Tourism Research, 26(1), 15–29. https://doi.org/10.1080/10941665.2020.1805476
Huang, H., Xie, Y., Chen, J., Liang, S., & Chen, Z. (2024). Bioclimatic design strategy of vernacular architecture in the south-east of China: A case study in Fujian, China. International Journal of Low-Carbon Technologies, 19, 1–17. https://doi.org/10.1093/ijlct/ctad079
Huo, H., Shen, K., Han, C., & Yang, M. (2024). Measuring the relationship between museum attributes and visitors: An application of topic model on museum online reviews. PLOS ONE, 19(7), e0304901. https://doi.org/10.1371/journal.pone.0304901
Ji, G., & Abd Manan, M. S. (2023). An overview of ‹all-forone tourism› development and possible future research directions for Ningxia›s tourism using VUP (Visual Urban Perception). e-Review of Tourism Research, 20(1), 23–50.
Ji, G., & Abd Manan, M. S. (2024). Iconic visions: Shaping Yinchuan›s future tourism business through landmark perceptions. International Journal of Business and Technology Management, 6(3), 79–91. https://doi.org/10.55057/ijbtm.2024.6.3.9
Junjie Su. (2018). Conceptualising the subjective authenticity of intangible cultural heritage. International Journal of Heritage Studies, 24(10), 1032–1047. https://doi.org/10.1080/13527258.2018.1428662
Lane, B. (1994). What is rural tourism? Journal of Sustainable Tourism, 2(1–2), 7–21. https://doi.org/10.1080/09669589409510680
Li, S.-L., Li, L., Cao, M.-W., Cao, L., Jia, W., & Liu, X.-P. (2017). Rapid modeling of Chinese Huizhou traditional vernacular houses. IEEE Access, 5, 20668–20683. https://doi.org/10.1109/ACCESS.2017.2754858
Li, X., & Lei, L. (2021). A bibliometric analysis of topic modelling studies (2000–2017). Journal of Information Science, 47(2), 161–175. https://doi.org/10.1177/0165551519877049
Lindholm, K.-J., & Ekblom, A. (2019). A framework for exploring and managing biocultural heritage. Anthropocene, 25, 100195. https://doi.org/10.1016/j.ancene.2019.100195
Liu, B., Wu, C., Xu, W., Shen, Y., & Tang, F. (2024). Emerging trends in GIS application on cultural heritage conservation: A review. Heritage Science, 12(1), 139. https://doi.org/10.1186/s40494-024-01265-7
Liu, X. (2024). The effects of commercialisation on urban heritage in Tianjin: A study of citizens’ livelihood in the Five Avenues (Wudadao) historical district. Built Heritage, 8(1), 42. https://doi.org/10.1186/s43238-024-00146-z
Liu, Z., Zhang, M., & Osmani, M. (2023). Building Information Modelling (BIM) driven sustainable cultural heritage tourism. Buildings, 13(8), 1925. https://doi.org/10.3390/buildings13081925
Long, C., Lu, S., Chang, J., Zhu, J., & Chen, L. (2022). Tourism environmental carrying capacity review, hotspot, issue, and prospect. International Journal of Environmental Research and Public Health, 19(24), 16663. https://doi.org/10.3390/ijerph192416663
López-González, C., & García-Valldecabres, J. (2023). The integration of HBIM-SIG in the development of a virtual itinerary in a historical centre. Sustainability, 15(18), 13931. https://doi.org/10.3390/su151813931
Manan, M. S. A., & Yuan, Y. Z. (2025). Parasitic architecture: Developing a thematic design framework for adaptive reuse as an urban regeneration strategy in Kuala Lumpur, Malaysia. MAJ, 7(5), 85–96.
McInnes, L., Healy, J., & Melville, J. (2018). UMAP: Uniform Manifold Approximation and Projection for dimension reduction. arXiv. https://doi.org/10.48550/arXiv.1802.03426
McInnes, L., Healy, J., & Melville, J. (2020). Algorithms for hierarchical clustering: An overview. arXiv. https://doi.org/10.1002/widm.53
Medvecki, D., Bašaragin, B., Ljajić, A., & Milošević, N. (2024). Multilingual transformer and BERTopic for short text topic modeling: The case of Serbian (Vol. 872, pp. 161–173). https://doi.org/10.1007/978-3-031-50755-7_16
Miao, Y., & Chiou, S.-C. (2013). Study on the wind environment of the architecture communities: Traditional typical Min Nan human settlements› case. Mathematical Problems in Engineering, 2013, 1–11. https://doi.org/10.1155/2013/467076
Ministry of Housing and Urban-Rural Development of the People’s Republic of China (MOHURD). (2012). Guiding opinions on strengthening the protection and development of traditional villages. Available from: https://www.mohurd.gov.cn/gongkai/zc/wjk/art/2012/art_17339_212337.html [Last accessed on 2024 Oct 24].
Ministry of Housing and Urban-Rural Development of the People’s Republic of China (MOHURD). (2013). Notice on issuing the basic requirements for the compilation of traditional village protection and development planning (Trial). Available from: https://www.mohurd.gov.cn/gongkai/zc/wjk/art/2013/art_17339_215684.html [Last accessed on 2024 Oct 24].
Murtagh, F., & Contreras, P. (2011). Algorithms for hierarchical clustering: An overview. WIREs Data Mining and Knowledge Discovery, 2(1), 86–97. https://doi.org/10.1002/widm.53
Nguyen, L. T., Chansanam, W., Hunsapun, N., Chaichuay, V., Kanyacome, S., Takhom, A., Jaroenruen, Y., & Li, C. (2024). Evaluating the performance of topic modeling techniques for bibliometric analysis research: An LDA-based approach. HighTech and Innovation Journal, 5(2), 312–330. https://doi.org/10.28991/HIJ-2024-05-02-07
Nguyen, T., Van, L. N., Duc, A. N., & Viet, S. D. (2025). A framework for neural topic modeling with mutual information and group regularization. Neurocomputing, 645, 130420. https://doi.org/10.1016/j.neucom.2025.130420
Olğun, T. N., & Karatosun, M. B. (2019). Rural architectural heritage conservation and sustainability in Turkey: The case of Karaca village of Malatya region. International Journal of Design & Nature and Ecodynamics, 14(3), 195–205. https://doi.org/10.2495/DNE-V14-N3-195-205
Olukoya, O. A. P. (2021). Framing the values of vernacular architecture for a value-based conservation: A conceptual framework. Sustainability, 13(9), 4974. https://doi.org/10.3390/su13094974
Orozco Carpio, P. R., & Rolim, R. (2024). GIS and HBIM for tourism management: A multiscale challenge. In: Proceedings Heritage Digital Technologies and Tourism Management - HEDIT2024. June 20-21, 2024, Valencia, Spain. https://doi.org/10.4995/hedit2024.2024.17751
Oumoumen, K. (2024). Automation of historical buildings: Historical Building Information Modeling (HBIM) based Virtual Reality (VR). Materials Research Proceedings, 40, 319–322. https://doi.org/10.21741/9781644903117-34
Peng, Y., Meng, M., Huang, Z., Wang, R., & Cui, G. (2021). Landscape connectivity analysis and optimization of Qianjiangyuan National Park, Zhejiang Province, China. Sustainability, 13(11), 5944. https://doi.org/10.3390/su13115944
Qianda, Z., Guoquan, Z., Hussein, M. K., Ariffin, N. F. M., & Yunos, M. Y. M. (2021). Identification of rural vernacular building character and conservation strategy from the perspective of rural tourism—A case study of Yayou Gou Village in Shandong Province, China. E3S Web of Conferences, 251, 02076. https://doi.org/10.1051/e3sconf/202125102076
Saputra, R. (2024). Governance frameworks and cultural preservation in Indonesia: Balancing policy and heritage. Journal of Ethnic and Cultural Studies, 11(3), 25–50. https://doi.org/10.29333/ejecs/2145
Sharpley, R. (2002). Rural tourism and the challenge of tourism diversification: The case of Cyprus. Tourism Management, 23(3), 233–244. https://doi.org/10.1016/S0261-5177(01)00078-4
Shirvani Dastgerdi, A., & Kheyroddin, R. (2022). Policy recommendations for integrating resilience into the management of cultural landscapes. Sustainability, 14(14), 8500. https://doi.org/10.3390/su14148500
Shirvani Dastgerdi, A., & Kheyroddin, R. (2023). Building resilience in cultural landscapes: Exploring the role of transdisciplinary and participatory planning in the recovery of the Shushtar Historical Hydraulic System. Sustainability, 15(13), 10433. https://doi.org/10.3390/su151310433
Siewczyński, B., & Szot, J. (2025). BIM goals and uses in the management, maintenance, and preservation of historic buildings: An open access perspective. Implementation characteristics of HBIM for improved documentation and lifecycle management. npj Heritage Science, 13(1), 103. https://doi.org/10.1038/s40494-025-01588-z
Su, M., Sun, Y., Min, Q., & Jiao, W. (2018). A community livelihood approach to agricultural heritage system conservation and tourism development: Xuanhua Grape Garden urban agricultural heritage site, Hebei Province of China. Sustainability, 10(2), 361. https://doi.org/10.3390/su10020361
Tang, C., Qin, S., Dai, X., & Lv, J. (2023). A review and prospect of China›s rural revitalization research from the perspective of culture and tourism. Progress in Geography, 42(8), 1437–1452. https://doi.org/10.18306/dlkxjz.2023.08.001
Tang, P., Wang, X., & Shi, X. (2019). Generative design method of the facade of traditional architecture and settlement based on knowledge discovery and digital generation: A case study of Gunanjie Street in China. International Journal of Architectural Heritage, 13(5), 679–690. https://doi.org/10.1080/15583058.2018.1463415
Tao, R., Chen, P., & Aoki, N. (2025). Conceptual changes and controversies in rural historical building relocation in China under the heritage adaptive reuse discourse. Built Heritage, 9(1), 6. https://doi.org/10.1186/s43238-024-00172-x
Tenzer, M., & Schofield, J. (2024). Using topic modelling to reassess heritage values from a people-centred perspective: Applications from the North of England. Cambridge Archaeological Journal, 34(1), 147–168. https://doi.org/10.1017/S0959774323000203
Wang, H., Liu, N., Chen, J., & Guo, S. (2022). The relationship between urban renewal and the built environment: A systematic review and bibliometric analysis. Journal of Planning Literature, 37(2), 293–308. https://doi.org/10.1177/08854122211058909
Wang, L. (2021). Causal analysis of conflict in tourism in rural China: The peasant perspective. Tourism Management Perspectives, 39, 100863. https://doi.org/10.1016/j.tmp.2021.100863
Wang, Y., Huang, W., & Yao, X. (2021). Research on the evaluation of tourism destination image based on user generated content. 2021 2nd International Conference on Artificial Intelligence and Information Systems, 1–5. https://doi.org/10.1145/3469213.3470694
Wu, M.-Y., Wu, X., Li, Q.-C., & Tong, Y. (2022). Community citizenship behavior in rural tourism destinations: Scale development and validation. Tourism Management, 89, 104457. https://doi.org/10.1016/j.tourman.2021.104457
Wu, Z., Mu, D., Sun, M., Xiong, Q., & Xiao, S. (2025). Evaluating rural tourism landscapes in world heritage buffer zones using PLES (Production—Living—Ecological Space) theory. npj Heritage Science, 13(1), 464. https://doi.org/10.1038/s40494-025-02033-x
Xie, K., Zhang, Y., & Han, W. (2024). Architectural heritage preservation for rural revitalization: Typical case of traditional village retrofitting in China. Sustainability, 16(2),681. https://doi.org/10.3390/su16020681
Yao, P. (2016). Research on the space environment characteristic and the development of the traditional village in China: Taking Pengzhuang in the northern Jiangsu province as an example. Brazilian Archives of Biology and Technology, 59(spe). https://doi.org/10.1590/1678-4324-2016160543
Yung, E. H. K., Zhang, Q., & Chan, E. H. W. (2017). Underlying social factors for evaluating heritage conservation in urban renewal districts. Habitat International, 66, 135–148. https://doi.org/10.1016/j.habitatint.2017.06.004
Zhang, X. (2015). Design of recreational corridor planning of Henan vernacular landscape based on resource integration. In: Proceedings of the 2015 International Forum on Energy, Environment Science and Materials. September 25-26, 2015, Shenzhen, China. https://doi.org/10.2991/ifeesm-15.2015.262
Zhang, Z., Xiong, K., & Huang, D. (2023). Natural world heritage conservation and tourism: A review. Heritage Science, 11(1), 55. https://doi.org/10.1186/s40494-023-00896-6
Zhao, X., & Greenop, K. (2019). From ‹neo-vernacular› to ‹semi-vernacular›: A case study of vernacular architecture representation and adaptation in rural Chinese village revitalization. International Journal of Heritage Studies, 25(11), 1128–1147. https://doi.org/10.1080/13527258.2019.1570544
Zhou, J. (2021). Statistical research on the development of rural tourism economy industry under the background of big data. Mobile Information Systems, 2021, 1–11. https://doi.org/10.1155/2021/9152173
Zhou, L., Wall, G., Zhang, D., & Cheng, X. (2021). Tourism and the (re)making of rural places: The cases of two Chinese villages. Tourism Management Perspectives, 40, 100910. https://doi.org/10.1016/j.tmp.2021.100910
Zhu, X.-X., Mu, Q.-R., & Liang, W.-Z. (2022). An innovative strategic choice for stakeholders in the Chinese traditional commercial street renewal using evolutionary game theory. Journal of Innovation & Knowledge, 7(3), 100225. https://doi.org/10.1016/j.jik.2022.100225
