Some image scraping silently fails #131

maximecb · 2024-11-10T16:54:36Z

First I'd like to say thank you for putting in the work to create this export script 🙏

I'd like to report an issue with the image export. The script seems to be able to scrape most of the images on my blog just fine, but some images seem to randomly not get scraped and I don't know why. Some of the images are wrapped in links while others are not.

Example in the exported output. I would have expected this image to get scraped and be replaced by a relative URL:

![](https://lh5.googleusercontent.com/t2kGCE-RAuoB1DvJl7oJxqEvAShWRjHb9_r1rgw8Q84ZuBivDJzbZUF-HLbxbIvVlN1gEHYQFVJmYdDpJbRmL167WvxhTbb0eUkquWsy0B2v85gi0IlT-kOCjPPO95iMXvdZRt1V)

I ran the export script with the following arguments:

node index.js \
--input=pointersgonewild.xml \
--output=exported \
--include-other-types=false \
--year-folders=false \
--month-folders=false \
--post-folders=true \
--prefix-date=true \
--save-attached-images=true \
--save-scraped-images=true

@lonekorean I would say that your WordPress export script is very nearly perfect. I can go and manually scrape the failed images myself, but it would be even better if the script could handle it.

The text was updated successfully, but these errors were encountered:

maximecb changed the title ~~Some image scraping fails~~ Some image scraping silently fails Nov 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some image scraping silently fails #131

Some image scraping silently fails #131

maximecb commented Nov 10, 2024

Some image scraping silently fails #131

Some image scraping silently fails #131

Comments

maximecb commented Nov 10, 2024