Page Replica – Tool for Web Scraping, Prerendering, and SEO Boost

“Page Replica” is a versatile web scraping and caching tool built with Node.js, Express, and Puppeteer. It helps with prerendering web app pages and allows for selective removal of JavaScript, caching the resulting HTML. One unique feature is its Nginx configuration that efficiently handles user and search engine bot traffic. To use the tool, you can clone the repository and install the dependencies. By making GET requests to the API, you can scrape individual web pages or sitemaps, and the prerendered copies will be stored in the cache folder. Additionally, the tool provides an Nginx configuration for serving the cached pages to search engine bots. Contributions and feature requests are welcome.

https://github.com/html5-ninja/page-replica

To top