puppeteer can't run open the url in a simple code

puppeteer can't run open the url in a simple code - node.js

I have this code:
const puppeteer = require("puppeteer");
(async () => {
const browser = await puppeteer.launch({
headless: false,
ignoreDefaultArgs: ["--disable-extensions"],
});
const page = await browser.newPage();
await page.goto("https://google.com");
await browser.close();
})();
when I run it, the chromium browser opens, but :
Thank you !

I downgraded my puppeteer version to 18.1.0 and runned ly code again and it worked as I expected, Thank you!

Related

How can I enable Chromium Extensions?

I am trying to open Chromium with extensions, but I cannot figure out how to do this. When chromium opens there are no extensions installed.
I tried to open with '--enable-remote-extensions', --load-extension=`, I tried to drag and drop the .crx into chromium extensions panel, but nothing worked.
I've got "An error has occurred
Installation is not enabled" and "Package is invalid: 'CRX_REQUIRED_PROOF_MISSING'
Could you help me with a working example ?
Thanks!

After lots of trial and error, I've solved this issue.
Below is the working code and I hope it will help someone else.
const puppeteer = require('puppeteer');
const extentionPath = "C:\\Users\\<YOUR_USERNAME>\\AppData\\Local\\Google\\Chrome\\User Data\\Profile 1\\Extensions\\<LONG_STRING_EXTENTION_ID>\\<EXTENTION_VERSION>"
(async () => {
const customArgs = [
`--start-maximized`,
`--load-extension=${extentionPath}`
];
const browser = await puppeteer.launch({
defaultViewport: null,
headless: false,
ignoreDefaultArgs: ["--disable-extensions", "--enable-automation"],
args: customArgs,
});
const page = await browser.newPage();
await page.goto(`https://google.com/`);
await page.waitForNavigation();
await page.close();
await browser.close();
})();

Puppeteer: Failed to launch the browser process! spawn

When I try to run node app.js, I get error:
the message is Failed to launch the browser process! spawn
/Users/iliebogdanbarbulescu/Downloads/firstProject/node_modules/chromium/lib/chromium/chrome-mac/Chromium.app
EACCES
What I did
I checked the folder at /Users/iliebogdanbarbulescu/Downloads/firstProject/node_modules/chromium/lib/chromium/chrome-mac/Chromium.app and the file is not zipped. It can be run.
Note:
If I try to execute without the path, it works, but
I would like to use either Chrome or Chromium to open a new page.
const browser = await puppeteer.launch({headless:false'});
const express = require('express');
const puppeteer = require('puppeteer');
const app = express();
(async () => {
const browser = await puppeteer.launch({headless:false, executablePath:'/Users/iliebogdanbarbulescu/Downloads/firstProject/node_modules/chromium/lib/chromium/chrome-mac/Chromium.app'});
const page = await browser.newPage();
await page.goto('https://google.com', {waitUntil: 'networkidle2'});
})().catch((error) =>{
console.error("the message is " + error.message);
});
app.listen(3000, function (){
console.log('server started');
})

If you navigate to chrome://version/ page in this exact browser, it will show the Executable Path which is the exact string you need to use as executablePath puppeteer launch option.
Usually, chrome's path looks like this on MAC:
/Applications/Google Chrome.app/Contents/MacOS/Google Chrome
Or something like this if chromium is located in your node_modules folder:
/Users/iliebogdanbarbulescu/Downloads/firstProject/node_modules/chromium/lib/chromium/chrome-mac/Chromium.app/Contents/MacOS/Chromium
Now if you compare the string you used for executablePath: it differs from the one retrieved with the method mentioned above. Exactly the /Contents/MacOS/Chromium should be added to the end of the current path to make it work.
Note: the chromium bundled with puppeteer is the version guaranteed to work together with the actual pptr version: if you plan to use other chrome/or chromium-based browsers you might experience unexpected issues.

Following up on #theDavidBarton:
Chromium which was shipped with Puppeteer did not work, but the Chrome installation on my MacBook did work.
OS: OS-X 10.15.7 (Catalina)
Node version: v14.5.0
Failed code:
const browser = await puppeteer.launch({
headless: true,
executablePath: "/users/bert/Project/NodeJS/PuppeteerTest/node_modules/puppeteer/.local-chromium/mac-818858/chrome-mac/Chromium.app/Contents/MacOS/Chromium"
});
Successful code:
const browser = await puppeteer.launch({
headless: true,
executablePath: "/Applications/Google Chrome.app/Contents/MacOS/Google Chrome"
});
Full code, Just the first example on the Puppeteer website:
const puppeteer = require('puppeteer');
(async () => {
try {
const browser = await puppeteer.launch({headless: true, executablePath: "/Applications/Google Chrome.app/Contents/MacOS/Google Chrome"});
const page = await browser.newPage();
await page.goto('https://example.com');
await page.screenshot({path: 'example.png'});
await browser.close();
} catch (err) {
console.log(err);
}
})();
And, yes, I got the Screenshot !! :-)

Using location-chrome: https://www.npmjs.com/package/locate-chrome
const locateChrome = require('locate-chrome');
const executablePath = await new Promise(resolve => locateChrome(arg => resolve(arg)));
const browser = await puppeteer.launch({ executablePath });

How to catch a download with playwright?

I'm trying to download a file from a website using Playwright. The button that triggers the download does some js and then the download starts.
Clicking the button using the .click function triggers a download but it shows an error: Failed - Download error.
I've tried using the devtools protocol Page.setDownloadBehavior, but this doesn't seem to do anything.
const playwright = require("playwright");
const { /*chromium,*/ devices } = require("playwright");
const iPhone = devices["iPad (gen 7) landscape"];
(async () => {
const my_chromium = playwright["chromium"];
const browser = await my_chromium.launch({ headless: false });
const context = await browser.newContext({
viewport: iPhone.viewport,
userAgent: iPhone.userAgent
});
const page = await context.newPage();
const client = await browser.pageTarget(page).createCDPSession();
console.log(client);
await client.send("Page.setDownloadBehavior", {
behavior: "allow",
downloadPath: "C:/in"
});
//...and so on
await page.click("#download-button");
browser.close();
})();
Full file here
There is a proposal for a better download api in Playwright, but I can't find the current API.
There was a suggestion that something to do with the downloadWillBegin event would be useful, but I've no idea how to access that from Playwright.
I'm open to the suggestion that I should use Puppeteer instead, but I moved to playwright because I couldn't work out how to download a file with Pupeteer either, and the issue related to it suggested that the whole team had moved to Playwright.

Take a look at the page.on("download")
const browser = await playwright.chromium.launch({});
const context = await browser.newContext({ acceptDownloads: true });
const page = await context.newPage();
await page.goto("https://somedownloadpage.weburl");
await page.type("#password", password);
await page.click("text=Continue");
const download = await page.waitForEvent("download");
console.log("file downloaded to", await download.path());

Embarassingly, I was closing the browser before the download had started.
It turns out that the download error was caused by the client section. However that means that I have no control over where the file is saved.
The download works when headless: false but not when headless: true.
If anyone has a better answer, that'd be great!

You can use waitForTimeout.
I tried with {headless: true} & await page.waitForTimeout(1000);
it's working fine. you can check same here

To download file (also its buffer) i highly recomend this module: Got node module. Its much easier, clean and light.
(async () => {
const response = await got('https://sindresorhus.com')
.on('downloadProgress', progress => {
// Report download progress
})
.on('uploadProgress', progress => {
// Report upload progress
});
console.log(response);
})();

How to fix 'page.target is not a function' in puppeteer?

I'm trying to use the Devtools Protocol with Puppeteer, but it throws the following error:
TypeError: page.target is not a function
This is my code:
const puppeteer = require('puppeteer');
(async() => {
// Use Puppeteer to launch a browser and open a page.
const browser = await puppeteer.launch({headless: true});
const page = await browser.newPage();
// Create a raw DevTools protocol session to talk to the page.
const session = await page.target().createCDPSession();
await page.goto('https://www.google.com');
})();
Am I missing something?

Make sure you're using the latest version of Puppeteer.
"dependencies": {
"puppeteer": "latest"
}

Run Puppeteer with Tor

I installed the Tor Expert Bundle and I would like to run it with Puppeteer.
I try:
const browser = await puppeteer.launch({headless: false,args:['--proxy-server="socks5://127.0.0.1:9050"']});
But I get the error ERR_NO_SUPPORTED_PROXIES. I can run it with a normal Chrome browser.

There's an opened bug in chromium regarding more complex configurations for proxy in headless mode (Source). There has not been any activity since July 2017.
However, I've been able to run Puppeteer (1.3.0) with a headless chromium and SOCKS5 proxy configuration.
const browser = await puppeteer.launch({args: ['--proxy-server=socks5://127.0.0.1:1337']});
Try updating Puppeteer, which also updates the bundled Chromium, and run again. Also seems like you might have a typo: remove the " between socks5://127.0.0.1:9050.

Looks like puppetteer can only by run with proxy with no headless mode

Based on Running Puppeteer with Tor.
/**************************************************************************
* IMPORTS
***************************************************************************/
const puppeteer = require('puppeteer')
/**************************************************************************
* DEMOS > USING PUPPETEER BEHIND TOR
* BASED ON https://medium.com/#jsilvax/running-puppeteer-with-tor-45cc449e5672
***************************************************************************/
;(async () => {
const browser = await puppeteer.launch({
args: ['--proxy-server=socks5://127.0.0.1:9050'],
headless: false,
})
const page = await browser.newPage()
await page.goto('https://check.torproject.org/')
const isUsingTor = await page.$eval('body', (el) =>
el.innerHTML.includes('Congratulations. This browser is configured to use Tor')
)
if (!isUsingTor) {
console.log('Not using Tor. Closing...')
return await browser.close()
}
console.log('Using Tor. Continuing... ')
// Now you can go wherever you want
await page.goto('https://www.facebook.com/')
// You would add additional code to do stuff...
// Then when you're done, just close
await browser.close()
})()

Develop Reference

node.js excel linux python-3.x azure haskell apache-spark rust .htaccess string

puppeteer can't run open the url in a simple code - node.js

I downgraded my puppeteer version to 18.1.0 and runned ly code again and it worked as I expected, Thank you!

Related

How can I enable Chromium Extensions?

Puppeteer: Failed to launch the browser process! spawn

How to catch a download with playwright?

How to fix 'page.target is not a function' in puppeteer?

Run Puppeteer with Tor

Categories

Resources