Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
126 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Ashaar: Automatic Analysis and Generation of Arabic Poetry Using Deep Learning Approaches (2307.06218v1)

Published 12 Jul 2023 in cs.CL

Abstract: Poetry holds immense significance within the cultural and traditional fabric of any nation. It serves as a vehicle for poets to articulate their emotions, preserve customs, and convey the essence of their culture. Arabic poetry is no exception, having played a cherished role in the heritage of the Arabic community throughout history and maintaining its relevance in the present era. Typically, comprehending Arabic poetry necessitates the expertise of a linguist who can analyze its content and assess its quality. This paper presents the introduction of a framework called \textit{Ashaar} https://github.com/ARBML/Ashaar, which encompasses a collection of datasets and pre-trained models designed specifically for the analysis and generation of Arabic poetry. The pipeline established within our proposed approach encompasses various aspects of poetry, such as meter, theme, and era classification. It also incorporates automatic poetry diacritization, enabling more intricate analyses like automated extraction of the \textit{Arudi} style. Additionally, we explore the feasibility of generating conditional poetry through the pre-training of a character-based GPT model. Furthermore, as part of this endeavor, we provide four datasets: one for poetry generation, another for diacritization, and two for Arudi-style prediction. These datasets aim to facilitate research and development in the field of Arabic poetry by enabling researchers and enthusiasts to delve into the nuances of this rich literary tradition.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (49)
  1. Ahmedabdel-aal/arabic-poem-generator: A pytorch rnn model to generate arabic poems. https://github.com/AhmedAbdel-Aal/Arabic-poem-Generator. (Accessed on 03/17/2023).
  2. Gheith Abandah and Asma Abdel-Karim. 2020. Accurate and fast recurrent neural network solution for the automatic diacritization of arabic text. Jordanian Journal of Computers and Information Technology, 6(2).
  3. Classifying and diacritizing arabic poems using deep recurrent neural networks. Journal of King Saud University-Computer and Information Sciences.
  4. Classification of arabic poems: from the 5th to the 5th century. In International Conference on Image Analysis and Processing, pages 179–186. Springer.
  5. Omar Abboushi and Mohammad Azzeh. 2023. Toward fluent arabic poem generation based on fine-tuning aragpt2 transformer. Arabian Journal for Science and Engineering, pages 1–13.
  6. Belal Abuata and Asma Al-Omari. 2018. A rule-based algorithm for the detection of arud meter in classical arabic poetry. International Arab Journal of Information Technology, 15(4):1–5.
  7. Authorship attribution in arabic poetry using nb, svm, smo. In 2016 11th International Conference on Intelligent Systems: Theories and Applications (SITA), pages 1–5. IEEE.
  8. Munef Abdullah Ahmed and Stefan Trausan-Matu. 2017. A program for analyzing classical arabic poetry for teaching purposes. Rom. J. Hum.-Comput. Interact, 10(4):331–344.
  9. Authorship attribution in arabic poetry’context using markov chain classifier.
  10. Ensemble methods for instance-based arabic language authorship attribution. IEEE Access, 8:17331–17345.
  11. Meter classification of arabic poems using deep bidirectional recurrent neural networks. Pattern Recognition Letters, 136:1–7.
  12. Metrec: A dataset for meter classification of arabic poetry. Data in Brief, 33:106497.
  13. Basrah: an automatic system to identify the meter of arabic poetry. Natural Language Engineering, 20(1):131–149.
  14. Mohammad M Albaddawi and Gheith A Abandah. 2021. Pattern and poet recognition of arabic poems using bilstm networks. In 2021 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT), pages 72–77. IEEE.
  15. Abdulrahman Almuhareb. 2016. Arabic poetry focused crawling using svm and keywords. In 2016 4th Saudi International Conference on Information Technology (Big Data Analysis)(KACSTIT), pages 1–4. IEEE.
  16. Recognition of classical arabic poems. In Proceedings of the Workshop on Computational Linguistics for Literature, pages 9–16.
  17. Recognition of modern arabic poems. J. Softw., 10(4):454–464.
  18. Finding arabic poem meter using context free grammar.
  19. Arabic poetry meter categorization using machine learning based on customized feature extraction. In 2021 International Conference on Intelligent Technology, System and Service for Internet of Everything (ITSS-IoE), pages 1–4. IEEE.
  20. Emotion classification in arabic poetry using machine learning. International Journal of Computer Applications, 65(16).
  21. Arabic authorship attribution: An extensive study on twitter posts. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 18(1):1–51.
  22. Alaa Saleh Altheneyan and Mohamed El Bachir Menai. 2014. Naïve bayes classifiers for authorship attribution of arabic texts. Journal of King Saud University-Computer and Information Sciences, 26(4):473–484.
  23. Aragpt2: Pre-trained transformer for arabic language generation. arXiv preprint arXiv:2012.15520.
  24. Mohamed El Ghaly Beheitt and Moez Ben Haj Hmida. 2022. Automatic arabic poem generation with gpt-2. In ICAART (2), pages 366–374.
  25. Pattern matching in meter detection of arabic classical poetry. In 2020 IEEE/ACS 17th International Conference on Computer Systems and Applications (AICCSA), pages 1–8. IEEE.
  26. Enriching word vectors with subword information. Transactions of the association for computational linguistics, 5:135–146.
  27. Ibrahim Abu El-Khair. 2016. 1.5 billion words arabic corpus. arXiv preprint arXiv:1611.04033.
  28. Generating classical arabic poetry using pre-trained models. In Proceedings of the The Seventh Arabic Natural Language Processing Workshop (WANLP), pages 53–62.
  29. Arabic text diacritization using deep neural networks. In 2019 2nd international conference on computer applications & information security (ICCAIS), pages 1–7. IEEE.
  30. Discovering the applicability of classification algorithms with arabic poetry. In 2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT), pages 453–458. IEEE.
  31. Authorship attribution of arabic articles. In International Conference on Arabic Language Processing, pages 194–208. Springer.
  32. Expert system for testing the harmony of arabic poetry. Journal of Engineering Sciences, 1:401–411.
  33. E Khan. 2014. Using arabic poetry system for steganography. Asian Journal of Computer Science and Information Technology, 4(6):55–61.
  34. Mohammad S Khorsheed and Abdulmohsen O Al-Thubaity. 2013. Comparative evaluation of text classification techniques using a large diverse arabic dataset. Language resources and evaluation, 47(2):513–538.
  35. Mokthar Ali Hasan Madhfar and Ali Mustafa Qamar. 2020. Effective deep learning models for automatic diacritization of arabic text. IEEE Access, 9:273–288.
  36. Joan Mathilde Maling. 1973. The theory of classical Arabic metrics. Ph.D. thesis, Massachusetts Institute of Technology.
  37. Hashim Saleh Manna and Zamri Arifin. 2021. Metrics in arabic poetry.
  38. IA Mohammad. 2009. Naive bayes for classical arabic poetry classification. Al-Nahrain Journal of Science, 12(4):217–225.
  39. Ahmed Ibrahim Ahmed Omer and Michael Philip Oakes. 2017. Arud, the metrical system of arabic poetry, as a feature set for authorship attribution. In 2017 IEEE/ACS 14th International Conference on Computer Systems and Applications (AICCSA), pages 431–436. IEEE.
  40. Classical arabic poetry: Classification based on era. In 2020 IEEE/ACS 17th International Conference on Computer Systems and Applications (AICCSA), pages 1–6. IEEE.
  41. Bruno Paoli. 2001. Meters and formulas: The case of ancient arabic poetry. Belgian journal of linguistics, 15(1):113–136.
  42. Leveraging pre-trained checkpoints for sequence generation tasks. Transactions of the Association for Computational Linguistics, 8:264–280.
  43. Al-Zahrani Abdul Kareem Saleh and Moustafa Elshafei. 2012. Arabic poetry meter identification system and method. US Patent 8,219,386.
  44. Sameerah Talafha and Banafsheh Rekabdar. 2019. Arabic poem generation with hierarchical recurrent attentional network. In 2019 IEEE 13th International Conference on Semantic Computing (ICSC), pages 316–323. IEEE.
  45. Ben Wang and Aran Komatsuzaki. 2021. GPT-J-6B: A 6 Billion Parameter Autoregressive Language Model. https://github.com/kingoflolz/mesh-transformer-jax.
  46. Learning meters of arabic and english poems with recurrent neural networks: a step forward for language understanding and synthesis. arXiv preprint arXiv:1905.05700.
  47. Taha Zerrouki and Amar Balla. 2017. Tashkeela: Novel corpus of arabic vocalized texts, data for auto-diacritization systems. Data in brief, 11:147–151.
  48. A proposed system for the identification of modem arabic poetry meters (imap). In 2020 15th International Conference on Computer Engineering and Systems (ICCES), pages 1–5. IEEE.
  49. Mukhlisa Ziyovuddinova. 2021. Arud system in view of metric theory. The American Journal of Applied sciences, 3(05):234–239.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

Github Logo Streamline Icon: https://streamlinehq.com