Rayrun
← Back to videos

Tesseract JS | Extract Text From Image | Playwright Tutorial Part 66

Let's see how to extract text from an image and also learn how to scrap mobile numbers from the Justdial webpage. It's easy and fun

In this video, the content creator discusses two concepts. The first concept is converting an image into text, extracting the text from an image. The second concept is handling pop-ups on a website, specifically on the website justdial.com. The content creator goes over a code that waits for a pop-up to appear and then clicks on it to close it.

The video mainly focuses on converting an image into text. The content creator mentions that there is no direct way to extract text from an image using the Playwright library, so they use the Tesseract.js library to convert the image into text. The content creator provides a simple code snippet that utilizes Tesseract.js to convert the image and extract the text.

The content creator also explains that there are alternative methods available to extract text from pseudo elements using CSS, but mentions that these methods are not reliable as the structure and class names can change.

Overall, the video provides an introduction to converting images into text using Tesseract.js and highlights the challenges of extracting text from pseudo elements. The content creator demonstrates the process and provides code examples.

Related Videos

InnerHTML vs InnerText vs TextContent | Playwright Tutorial - Part 48
InnerHTML vs InnerText vs TextContent | Playwright Tutorial - Part 48
Element Handle VS Locator API | Playwright Tutorial - Part 47
Element Handle VS Locator API | Playwright Tutorial - Part 47
Playwright Version 1.14 | Playwright Tutorial - Part 45
Playwright Version 1.14 | Playwright Tutorial - Part 45
First Script - Auto Waits | Playwright - Part 3
First Script - Auto Waits | Playwright - Part 3
TwitterGitHubLinkedIn
AboutQuestionsDiscord ForumBrowser ExtensionTagsQA Jobs

Rayrun is a community for QA engineers. I am constantly looking for new ways to add value to people learning Playwright and other browser automation frameworks. If you have feedback, email [email protected].