AfriHG: News headline generation for African Languages
Toyib Ogunremi, Serah Akojenu, Anthony Soronnadi, Olubayo Adekanmbi, David Ifeoluwa Adelani
2024-12-28Headline Generation
Abstract
This paper introduces AfriHG -- a news headline generation dataset created by combining from XLSum and MasakhaNEWS datasets focusing on 16 languages widely spoken by Africa. We experimented with two seq2eq models (mT5-base and AfriTeVa V2), and Aya-101 LLM. Our results show that Africa-centric seq2seq models such as AfriTeVa V2 outperform the massively multilingual mT5-base model. Finally, we show that the performance of fine-tuning AfriTeVa V2 with 313M parameters is competitive to prompting Aya-101 LLM with more than 13B parameters.
Related Papers
Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation2025-01-21Fact-Preserved Personalized News Headline Generation2025-01-21ComMer: a Framework for Compressing and Merging User Data for Personalization2025-01-05BeliN: A Novel Corpus for Bengali Religious News Headline Generation using Contextual Feature Fusion2025-01-02Abstractive Summarization of Low resourced Nepali language using Multilingual Transformers2024-09-29Creating Arabic LLM Prompts at Scale2024-08-12Multilingual Fine-Grained News Headline Hallucination Detection2024-07-22XL-HeadTags: Leveraging Multimodal Retrieval Augmentation for the Multilingual Generation of News Headlines and Tags2024-06-06