TungNT (Blue)

tungnt.blue@gmail.com

User Tools

Site Tools


development:python:ocr

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
development:python:ocr [2024/08/30 23:04] – [Tessereact] tungntdevelopment:python:ocr [2024/08/31 01:47] (current) – [Tessereact] tungnt
Line 48: Line 48:
 </code> </code>
  
-https://tesseract-ocr.github.io/tessdoc/Data-Files-in-different-versions.html+**Language:** 
 +  * https://tesseract-ocr.github.io/tessdoc/Data-Files-in-different-versions.html 
 +  * https://www.kaggle.com/code/dhorvay/pytesseract-multiple-languages
  
 +<code python>
 +print(pytesseract.get_languages(config='.'))
 +
 +['afr', 'amh', 'ara', 'asm', 'aze', 'aze_cyrl', 'bel', 'ben', 'bod', 'bos', 'bre', 'bul', 'cat', 'ceb', 'ces', 'chi_sim', 'chi_sim_vert', 'chi_tra', 'chi_tra_vert', 'chr', 'cos', 'cym', 'dan', 'deu', 'div', 'dzo', 'ell', 'eng', 'enm', 'epo', 'equ', 'est', 'eus', 'fao', 'fas', 'fil', 'fin', 'fra', 'frk', 'frm', 'fry', 'gla', 'gle', 'glg', 'grc', 'guj', 'hat', 'heb', 'hin', 'hrv', 'hun', 'hye', 'iku', 'ind', 'isl', 'ita', 'ita_old', 'jav', 'jpn', 'jpn_vert', 'kan', 'kat', 'kat_old', 'kaz', 'khm', 'kir', 'kmr', 'kor', 'kor_vert', 'lao', 'lat', 'lav', 'lit', 'ltz', 'mal', 'mar', 'mkd', 'mlt', 'mon', 'mri', 'msa', 'mya', 'nep', 'nld', 'nor', 'oci', 'ori', 'osd', 'pan', 'pol', 'por', 'pus', 'que', 'ron', 'rus', 'san', 'sin', 'slk', 'slv', 'snd', 'snum', 'spa', 'spa_old', 'sqi', 'srp', 'srp_latn', 'sun', 'swa', 'swe', 'syr', 'tam', 'tat', 'tel', 'tgk', 'tha', 'tir', 'ton', 'tur', 'uig', 'ukr', 'urd', 'uzb', 'uzb_cyrl', 'vie', 'yid', 'yor']
 +
 +from langdetect import detect_langs 
 +
 +detect_langs(ocr_result_original)
 +
 +tesseract --list-langs
 +</code>
 ====== How to Open an Image in Python with PIL (Pillow) ====== ====== How to Open an Image in Python with PIL (Pillow) ======
  
Line 548: Line 561:
   * https://www.affinda.com/tech-ai/how-to-convert-image-to-text-using-python   * https://www.affinda.com/tech-ai/how-to-convert-image-to-text-using-python
   * https://tesseract-ocr.github.io/tessdoc/Compiling.html#macos   * https://tesseract-ocr.github.io/tessdoc/Compiling.html#macos
 +  * https://nanonets.com/blog/ocr-with-tesseract/
development/python/ocr.1725059057.txt.gz · Last modified: 2024/08/30 23:04 by tungnt

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki