Blog Ask AI Tools Videos QA Wiki Discord

How to download PDF using playwright?

I have a page on which there is a download link. In my test i want to click on the link and download the file. I am running the test on chrome and whenever i click on the link using playwright instead of download the pdf document is displayed in chromium’s internal pdf viewer. I looked at multiple similar issues mentioned on playwright git repo but none of them seems to be working. Has anyone implemented the pdf download step in any of the test cases?

This thread is trying to answer question "How can I download a PDF using Playwright when the PDF opens in Chromium's internal viewer instead of downloading?"

14 replies

cporter97August 7 1:39 PM

export default defineConfig({ use: { acceptDownloads: false, }, });

cporter97August 7 1:39 PM

Have you configured the browser to acceptDownloads?

vipinphogatAugust 7 1:40 PM

Yes… i did it in config file as mentioned here

https://playwright.dev/docs/api/class-testoptions#test-options-accept-downloads

cporter97August 7 1:41 PM

The default is true so shouldn't be the issue actually

cporter97August 7 1:42 PM

do you have a listener set up for the download?

vipinphogatAugust 7 1:43 PM

Right… i also tried with page.waitForEvent(‘download’) but still the pdf is opening instead of downloading. It seems the listener works only for non pdf docs

cporter97August 7 1:44 PM

Could you use page.pdf() to generate a pdf of the page once it opens in a new tab?

cporter97August 7 1:44 PM

https://playwright.dev/docs/api/class-page#page-pdf

cporter97August 7 1:44 PM

I know it isn't the greatest solution

vipinphogatAugust 7 1:47 PM

This will generate the pdf of current page. I want to download the pdf after i click on the link to which it is attached

cporter97August 7 1:50 PM

Right but if it generates "chromium’s internal pdf viewer." then surely this would generate the pdf you want.

Not suggesting this as the final solution but a potential work around

cporter97August 7 1:54 PM

You could do something similar to this -

Get the href from the "download pdf" button

href = page.locator("a.download-pdf").get_attribute('href')
absolute_url = f"https://arxiv.org{href}"

# Download the file using requests
file = requests.get(absolute_url)
with open('output.pdf', 'wb') as f:
    f.write(file.content)

cporter97August 7 1:55 PM

https://www.scrapingbee.com/webscraping-questions/playwright/how-to-download-file-with-playwright/#:~:text=You%20can%20download%20a%20file,download%20the%20file%20using%20requests%20.

vipinphogatAugust 7 1:59 PM

I will try this and get back with results… thanks for quick reply

Open in Discord

Related Discord Threads

Blog Ask AI Tools Videos QA Wiki Discord

About Questions Discord Forum Browser Extension Tags QA Jobs

Rayrun is a community for QA engineers. I am constantly looking for new ways to add value to people learning Playwright and other browser automation frameworks. If you have feedback, email [email protected].