Search results
Oct 5, 2010 · Riyad-us Saliheen (translation: Gardens of the Righteous) is a Compilation of verses from the Qur'an and hadith by Shaykh Abu Zakaria Mohiuddin Yahya Ibn Sharaf al-Nawawi (1234â1278). It contains approximately 1900 hadiths mainly from the Six major Hadith collections.
Feb 23, 2023 · Addeddate 2023-02-23 07:28:43 Identifier riyad-us-saliheen-urdu Identifier-ark ark:/13960/s2nnsv18w31 Ocr tesseract 5.3.0-1-gd3a4
Mar 17, 2016 · riyaz ul quran (urdu translation of quran by maulana yunus sahab palanpuri d.b.)
Riaz-Us-Saliheen Vol-1 With Urdu Translation. - Free ebook download as PDF File (.pdf), Text File (.txt) or read book online for free. Riaz-Us-Saliheen Vol-1 With Urdu Translation.
- (62)
- Overview
- Navigating this repository
- Contribution
- License / Copyright
An Urdu text corpus to enable research and applications for the Urdu language. We believe Maḵẖzan is the best Urdu dataset to start work for Urdu NLP.
This dataset currently comprises 6.26 million words of Urdu text. We have selected source text that we believe to have gone through strong editorial standards, to preserve linguistic integrity. The text is then syntactically marked up, so that headings, paragraphs, and lists can be identified. Metadata is added to each file so data can be intelligently filtered and selected. We annotate non-Urdu text included in source publications. Data also goes through an intense cleaning process to make the text easier to read for software, as well as correcting typograghical errors.
•/docs: Documentation
•/scripts: Scripts to analyze the text, constructed as a Swift package.
•/stats: Output of text analyses, which can be used out of the box to power NLP applications, such as word and n-gram frequencies.
•/text: The text corpus itself, consisting of XML files.
Material in the /text directory
All files in the /text directory are covered under standard copyright. Each piece of text has been included in this repository with explicity permission of respective copyright holders, who are identified in the tag for each file. You are free to use this text for analysis, research and development, but you are not allowed to redistribute or republish this text. Where possible we encourage that forks of this repository be kept private unless explicit permission is granted. Some cases where a less restrictive license could apply to files in the /text directory are presented below. In some cases copyright free text has been digitally reproduced through the hard work of our collaborators. In such cases we have credited the appropriate people where possible in a field in the file's metadata, and we strongly encourage you to contact them before redistributing this text in any form. Where a separate license is provided along with the text, we have provided corresponding data in the field in a file's metadata.
All other materials
All other materials in this repository (such as software, aggregated analyses and documentation) in the /scripts or /stats directory are licensed under the terms of the MIT license.
Copyright concerns
If you feel any material has been included in this repository erroneously and/or copyright arrangements have not been respected, please file an issue on this repository or get in touch through our website.
Aug 24, 2020 · Maḵẖzan is an open-sourced Urdu text corpus. As we built the autocorrect technology that powers Matnsāz, we found that existing Urdu text corpuses were limited. Instead of building a new proprietary text corpus, or relying on workarounds to bad data, we decided to create a high-quality Urdu data source and share it.
6 days ago · Chad Griffiths Energy co-founder Naeem Tyab delays own FCPA sentencing hearing. The head of the defunct Canadian junior asked a DC court to postpone the hearing to give him time to gather more documents about his acquisition of two Doba Basin oil permits in 2011.