Implementación de redes neuronales para la clasificación de fuentes informativas en periodismo digital sobre inteligencia artificial

Fred Torres-Cruz; Yudi Janeh Yucra-Mamani; Walker Ernesto Aragón Cruz; Mariana Esther Tovar Yucra

doi:10.31637/epsir-2025-1436

Authors

Fred Torres-Cruz Universidad Nacional del Altiplano https://orcid.org/0000-0003-0834-6834
Yudi Janeh Yucra-Mamani Universidad Nacional del Altiplano https://orcid.org/0000-0002-9483-7949
Walker Ernesto Aragón Cruz Universidad Nacional del Altiplano https://orcid.org/0000-0002-0139-2961
Mariana Esther Tovar Yucra Catholic University of Santa María https://orcid.org/0009-0002-7522-3826

DOI:

https://doi.org/10.31637/epsir-2025-1436

Keywords:

Classification, Informative Sources, Artificial Intelligence, Journalism, Neural Networks, SHAP, LIME

Abstract

Introduction: In the digital era, classifying information sources is crucial for maintaining the quality of journalism, especially with artificial intelligence (AI). This study employs neural networks for this task, evaluating their effectiveness and providing clear interpretations of the results. Methodology: A dataset with 14 characteristics of journalistic content was used, including genre, publication section, source type, and multimedia presence. The target variable classified the primary source of the text into categories such as expert, political, cultural, religious, journalistic, and others. The neural network model had two hidden dense layers with 64 neurons each, using ReLU activation. It was trained and evaluated with data split into training and testing sets. Feature standardization improved the model’s performance, achieving 46% accuracy in testing. Results: Techniques like SHAP and LIME were applied to interpret the model’s predictions. SHAP identified the most influential features. LIME provided a detailed understanding of how specific features affect predictions. Conclusions: This study proposes an innovative approach to classifying information sources in digital journalism and highlights the importance of interpretability in AI models.

Downloads

Download data is not yet available.

Author Biographies

Fred Torres-Cruz, Universidad Nacional del Altiplano

Statistical and Computer Engineer with a Master's Degree in Systems Engineering and PhD student in Computer Science at the Universidad Nacional del Altiplano, member of the Computer Science Research Institute. Currently, he teaches at the Professional School of Statistical Engineering and Computer Science at UNAP, and teaches undergraduate and graduate courses at different universities. RENACYT Researcher.

Yudi Janeh Yucra-Mamani, Universidad Nacional del Altiplano

Degree in Social Communication Sciences from the Universidad Nacional del Altiplano Puno (UNAP), Master Scientiae in Social Sciences, mention in Communication for Development, Doctoris Scientiae in Social Sciences. Main professor of the Professional School of Social Communication Sciences of the UNAP, teaches undergraduate and postgraduate courses at the UNAP. RENACYT Researcher. Director of the Institute of Social and Business Research (IDISEM). With professional experience in Journalism and Public Relations.

Walker Ernesto Aragón Cruz, Universidad Nacional del Altiplano

He holds a degree in Social Communication Sciences from the Universidad Nacional del Altiplano Puno (UNAP), a Master's Degree in Sciences, with mention in Communication and Doctoris Scientiae in Social Sciences. Currently, he is an assistant professor at the Professional School of Social Communication Sciences of the UNAP, and teaches undergraduate and graduate courses at the UNAP. RENACYT Researcher. And member of the board of directors of the Institute of Social and Business Research (IDISEM). He has professional experience in Communication for development and media production.

Mariana Esther Tovar Yucra, Catholic University of Santa María

Psychology student at the Catholic University of Santa Maria (Arequipa). She has participated as a speaker at international and national scientific congresses, has publications on communication, gender and transmedia. She did internships in public and private educational institutions in Arequipa. In the organizational field, she focused on human resources and personnel management.

References

Abdulmajeed, M. y Fahmy, N. (2023). Meta-analysis of AI Research in Journalism: Challenges, Opportunities and Future Research Agenda for Arab Journalism. En N. Editor (Ed.), Título del libro (pp. 213-225). https://doi.org/10.1007/978-3-031-17746-0_18 DOI: https://doi.org/10.1007/978-3-031-17746-0_18

Ángel, I. T. y Franco, Y. G. (2019). Periódicos digitales españoles e información sobre robótica e inteligencia artificial: Una aproximación a imaginarios y realidades desde una perspectiva de género. Revista de Comunicación de la SEECI, 48, 173-189. https://doi.org/10.15198/seeci.2019.48.173-189 DOI: https://doi.org/10.15198/seeci.2019.48.173-189

Canavilhas, J. y Begoña, I.-N. (2012). Uso y credibilidad de fuentes periodísticas 2.0 en Portugal y España. El Profesional de La Información, 1, 63-69. https://doi.org/10.3145/epi.2012.ene.08 DOI: https://doi.org/10.3145/epi.2012.ene.08

Canavilhas, J. y Giacomelli, F. (2023). Inteligencia artificial en el periodismo deportivo: estudio en Brasil y Portugal. Revista de Comunicación, 22(1), 53-69. https://doi.org/10.26441/rc22.1-2023-3005 DOI: https://doi.org/10.26441/RC22.1-2023-3005

Cloarec, J. (2022). Privacy controls as an information source to reduce data poisoning in artificial intelligence-powered personalization. Journal of Business Research, 152, 144-153. https://doi.org/10.1016/j.jbusres.2022.07.045 DOI: https://doi.org/10.1016/j.jbusres.2022.07.045

Cuevas, Y. (2011). Representaciones sociales en la prensa: aportaciones teóricas y metodológicas. Sinéctica. Revista Electrónica de Educación, 36, 1-19. http://www.sinectica.iteso.mx/index.php?cur=36&art=36_08

Demirci, S. y Sagiroglu, S. (2022). TwitterBulletin: An Intelligent and Real-Time Automated News Categorization Tool for Twitter. Journal of Universal Computer Science, 28(4), 345-377. https://doi.org/10.3897/jucs.69377 DOI: https://doi.org/10.3897/jucs.69377

Espin-Riofrio, C., Murillo-Cepeda, V., García-Zambrano, D., Morán, V. M., Gamboa, J. Z. y Montejo-Ráez, A. (julio de 2023). News Categorisation Based on Pre-Trained Transformer Models. https://bit.ly/3WqqBty DOI: https://doi.org/10.18687/LACCEI2023.1.1.1076

Fernandes, E., Moro, S. y Cortez, P. (2023). Data Science, Machine learning and big data in Digital Journalism: A survey of state-of-the-art, challenges and opportunities. Expert Systems with Applications, 221. https://doi.org/10.1016/j.eswa.2023.119795 DOI: https://doi.org/10.1016/j.eswa.2023.119795

Forja-Pena, T., García-Orosa, B. y López-García, X. (2024). The Ethical Revolution: Challenges and Reflections in the Face of the Integration of Artificial Intelligence in Digital Journalism. Communication and Society, 37(3 Special Issue), 237-254. https://doi.org/10.15581/003.37.3.237-254 DOI: https://doi.org/10.15581/003.37.3.237-254

Ftah, K. (2024). Sánchez-Gonzales, H. M. (ed.) (2023). Estrategias del periodismo en la esfera digital: innovación y formación. Estudios Sobre El Mensaje Periodístico, 30(1), 275-276. https://doi.org/10.5209/esmp.92099 DOI: https://doi.org/10.5209/esmp.92099

García, E., Huamán, F. y Palomino, H. W. (2021). Framing periodístico sobre el aborto en el Perú (2015-2019): un análisis comparativo entre la prensa de las regiones Lima metropolitana y Piura. Revista de Comunicación, 20(2), 189-206. https://doi.org/10.26441/RC20.2-2021-A10 DOI: https://doi.org/10.26441/RC20.2-2021-A10

Jiménez, C. y Nicolás-Sans, R. (2023). Ethical journalism vs digital journalism. VISUAL Review. International Visual Culture Review, 10, 1-10. https://doi.org/10.37467/revvisual.v10.4623 DOI: https://doi.org/10.37467/revvisual.v10.4623

Lermann Henestrosa, A., Greving, H. y Kimmerle, J. (2023). Automated journalism: The effects of AI authorship and evaluative information on the perception of a science journalism article. Computers in Human Behavior, 138. https://doi.org/10.1016/j.chb.2022.107445 DOI: https://doi.org/10.1016/j.chb.2022.107445

López-García, X. y Vizoso, Á. (2021). High-tech journalism: a sign of the digital era of the third millennium. Profesional de La Informacion, 30(3), 1-12. https://doi.org/10.3145/epi.2021.may.01 DOI: https://doi.org/10.3145/epi.2021.may.01

Marín, C. (2004). Manual de Periodismo. Grijalbo.

Martin-Neira, J. I., Trillo-Domínguez, M. y Olvera-Lobo, M. D. (2024). El periodismo científico en el actual ecosistema digital: retos y alertas desde la perspectiva de los profesionales chilenos. Revista Mediterranea de Comunicacion, 15(1), 39-58. https://doi.org/10.14198/MEDCOM.25346 DOI: https://doi.org/10.14198/MEDCOM.25346

Martínez-Vera, E., Rosado-Muñoz, A. y Bañuelos-Sánchez, P. (2024). Estimación del estado de carga de una batería de litio con redes neuronales y validación con FPGA-en-lazo. Revista Iberoamericana de Automática e Informática Industrial, 21(3), 243-251. https://doi.org/10.4995/riai.2024.20718 DOI: https://doi.org/10.4995/riai.2024.20718

Mazhar, K. y Dwivedi, P. (2024). Decoding the black box: LIME-assisted understanding of Convolutional Neural Network (CNN) in classification of social media tweets. Social Network Analysis and Mining, 14(1). https://doi.org/10.1007/s13278-024-01297-8 DOI: https://doi.org/10.1007/s13278-024-01297-8

Melo Alves, F. M. y Dos Santos, B. A. (2018). Traditional and digital information sources and resources: Some international classifications. Biblios, 72, 35-50. https://doi.org/10.5195/biblios.2018.459 DOI: https://doi.org/10.5195/biblios.2018.459

Møller, H. J. y Thylstrup, N. B. (2024). The Algorithmic Gut Feeling–Articulating Journalistic Doxa and Emerging Epistemic Frictions in AI-Driven Data Work. Digital Journalism. https://doi.org/10.1080/21670811.2024.2319641 DOI: https://doi.org/10.1080/21670811.2024.2319641

Montaño, J. J. (2002). Redes neuronales artificiales aplicadas al análisis de datos [Tesis de doctorado, Universidad de les Illes Balears]. http://hdl.handle.net/10803/9441

OpenAI. (2022). Online ChatGPT: optimizing language models for dialogue. CloudHQ Blog. https://blog.cloudhq.net/openais-chatgpt-optimizing-language-models-for-dialogue/#what-is-chatgpt

Pavlik, J. V. (2023). Collaborating With ChatGPT: Considering the Implications of Generative Artificial Intelligence for Journalism and Media Education. Journalism & Mass Communication Educator, 78(1), 84-93. https://doi.org/10.1177/10776958221149577 DOI: https://doi.org/10.1177/10776958221149577

Pérez-Seijo, S., Barbosa, S. y Vicente, P. N. (2023). Artificial Intelligence in Journalism: Case Study of the Spanish, Portuguese and Brazilian News Media Systems. En Studies in Big Data (Vol. 140, pp. 261-274). https://doi.org/10.1007/978-3-031-43926-1_18 DOI: https://doi.org/10.1007/978-3-031-43926-1_18

Pérez, C., Gutiérrez Rubio, D., Sánchez González, T. y Zurbano Berenguer, B. (2015). The use of journalistic sources on politics, economy and culture sections in spanish journalism proximity. Estudios Sobre El Mensaje Periodistico, 21, 101-117. https://doi.org/10.5209/rev_ESMP.2015.v21.50661 DOI: https://doi.org/10.5209/rev_ESMP.2015.v21.50661

Pérez, S. y Salvador, R. (2016). Diagnóstico, manejo y evolución materno fetal de embarazadas y puerperas que conviven con VIH, atendidas en el Hospital Alemán Nicaragüense, período comprendido entre enero de 2014 a diciembre de 2015.

Pinto, M. C. y Barbosa, S. O. (2024). Artificial Intelligence (AI) in Brazilian Digital Journalism: Historical Context and Innovative Processes. Journalism and Media, 5(1), 325-341. https://doi.org/10.3390/journalmedia5010022 DOI: https://doi.org/10.3390/journalmedia5010022

Ponce-Rosas, E. R., Dávila-Mendoza, R., Jiménez-Galván, I., Fernández-Ortega, M. A., Ortiz-Montalvo, A. y Fajardo-Ortiz, G. (2023). Aplicación de redes neuronales artificiales en el liderazgo asignado y el éxito académico en egresados de medicina. Cirugía y Cirujanos, 91(4), 550-560. https://doi.org/10.24875/CIRU.22000318 DOI: https://doi.org/10.24875/CIRU.22000318

Prieto, A. (2018). El empleo de las fuentes informativas en el tratamiento de la ley de extranjería por la prensa española. Estudios Sobre El Mensaje Periodístico, 24(1), 323-340. https://doi.org/10.5209/ESMP.59953 DOI: https://doi.org/10.5209/ESMP.59953

Sabés, F. y Carniel Bugs, R. (2013). Tratamiento de los géneros periodísticos en la información de la prensa euromediterránea. Historia y Comunicación Social, 18, 15-32. https://doi.org/10.5209/rev_HICS.2013.v18.43945 DOI: https://doi.org/10.5209/rev_HICS.2013.v18.43945

Shi, B., Ifrim, G. y Hurley, N. (2016). Learning-to-rank for real-time high-precision hashtag recommendation for streaming news. En Proceedings of the 25th International Conference on World Wide Web (pp. 1191–1202). International World Wide Web Conferences Steering Committee. https://doi.org/10.1145/2872427.2882982 DOI: https://doi.org/10.1145/2872427.2882982

Yucra-Mamani, Y. J., Torres-Cruz, F. y Aragón-Cruz, W. E. (2024). Percepción visual en redes sociales de fotografías reales y sintetizadas mediante inteligencia artificial. VISUAL REVIEW. International Visual Culture Review / Revista Internacional de Cultura Visual, 16(4). https://doi.org/10.62161/revvisual.v16.5302 DOI: https://doi.org/10.62161/revvisual.v16.5302

Zhou, X., Li, Y., Sun, Y., Su, Y., Li, Y., Yi, Y. y Liu, Y. (2022). Research on Dynamic Monitoring of Grain Filling Process of Winter Wheat from Time-Series Planet Imageries. Agronomy, 12(10). https://doi.org/10.3390/agronomy12102451 DOI: https://doi.org/10.3390/agronomy12102451

Zunino, E. (2022). Brechas y concentración de la información: un estudio sobre agendas, encuadres y consumos de noticias sobre vacunas en el marco del Covid-19 en la Argentina. Revista de Comunicación, 21(1), 469-495. https://doi.org/10.26441/rc21.1-2022-a24 DOI: https://doi.org/10.26441/RC21.1-2022-A24