Creating a PDF from Irfanview screenshots of a book with missing pages on Google Books

Discussion:

Creating a PDF from Irfanview screenshots of a book with missing pages on Google Books

(too old to reply)

Oliver

2024-03-13 05:37:21 UTC

I couldn't find an out-of-print book online no matter how hard I tried.
Google Books had it, of course, but it was missing pages.

So I did this procedure, which works, but the results are too blurry.
Is there a better way to snapshot a book in Archive.org displays?

1. Create an account on archive.org (they verify the email)
<https://archive.org/account/signup>
2. Log in to borrow the hard-to-find book (you get only 1 hour)
<https://archive.org/account/login>
3. Position the book in your web browser as big as you can get it
(Set it up as one page at a time so the right-arrow button works)
4. Start Windows Irfanview & press "c" (capture)
5. Set "Capture method" to "Hot key" to "Ctrl + Left" (or whatever)
6. Set (5) Custom rectangle/region capture
Click the top right corner (e.g., 765:223)
Click the bottom right corner (e.g., 1183:842)
7. Manually subtract 1183-765=418 width, 842-223=619 length
8. Back to Irfanview, press "c" & select #7
Choose (7) Fixed screen rectangle
X-pos=765 Y-pos=223
Width=418 Height=619
9. (x) Save captured images as file
File name: ###_$U(%d%m%Y_%H%M%S)
Save as: PDF (or whatever format you want)
10. In Archive.org, press the right arrow in the GUI to "next page"
For each page, press "Ctrl + Left" for Irfanview to snapshot it
(that will create a numbered PDF file for each page of the book)
11. Open Adobe Acrobat Writer "File > Create PDF > From Multiple Files"
Hit the Browse button and in Windows load each file using
the first file selected and then shift-select to the last file
(it helps if you add a dummy 000 first page PDF & 999 last page PDF)
(You can delete those first and last dummy PDF pages later)
(The reason is Windows, somehow, screws up the first file.)
12. Then shrink the resulting multi-page PDF in Adobe Acrobat 6
Acrobat: File > Reduce File Size

If you try to use Calibre on the original PDF, it does not
improve the quality no matter which of the score of formats
you convert to and save.

Unfortunately, the archive.org images are blurry.
They're readable. But definitely blurry.

Google Books is clearer, but Google Books is missing pages.

Does anyone know if you go to Google Books over a period of
a few days or from different IP addresses, if all the pages
eventually show up?

VanguardLH

2024-03-13 07:13:22 UTC

Post by Oliver
I couldn't find an out-of-print book online no matter how hard I tried.
Google Books had it, of course, but it was missing pages.

When you looked at the book at Google Books, is the ISBN number listed?
A search on that might prove more fruitful. Cannot do the search for
you since neither the book, author, or ISBN was mentioned.

Oliver

2024-03-13 09:00:26 UTC

Post by VanguardLH

Post by Oliver
I couldn't find an out-of-print book online no matter how hard I tried.
Google Books had it, of course, but it was missing pages.

When you looked at the book at Google Books, is the ISBN number listed?
A search on that might prove more fruitful. Cannot do the search for
you since neither the book, author, or ISBN was mentioned.

If I can't find it, you'll never find it. Nobody will if I can't.

I wouldn't have asked the question if the epub was available anyway.
To be doubly clear, I'm not asking people to run my searches for me.
Especially on a PDF and editors and Windows newsgroup. I can run them.

I am not asking people to run my searches, as I am writing scripts to run
those searches as you can see in the other thread you had responded to.
<https://www.novabbs.com/computers/article-flat.php?id=77704&group=alt.comp.os.windows-10#77704>

But to answer your question, of course the book has not only an ISBN but it
has about five of them depending on the paperback, hardcover & edition.

Back to the original question, I went to the Google Books from three
different VPN browsers and I answered part of my own question, which is
that there are DIFFERENT pages which are displayed by Google Books.

The main question is how to get a better quality screenshot to PDF.
The secondary question, already partially answered, is if I go to Google
Books enough times from enough different IP addresses and browser
fingerprints, will the ENTIRE book eventually be revealed?

Has anyone tried that?
If you have, you could save me a LOT of time experimenting with it.

Oliver

2024-03-13 19:14:13 UTC

Post by Oliver
The main question is how to get a better quality screenshot to PDF.

Show the Screen with as big resolution as possible (far bigger than
the PDF will show) and then take the Screenshot. Maybe use an 8k
Screen?

Thanks for that advice, since this is not a question of how to download a
book, but how to save something that is presented only on screen in about
two hundred instances.

I have a working solution so what I'm seeking from experts is a better
working solution - specifically with higher resolution than the solution.

I did change my monitor from the recommended to other resolutions, but none
were as good as the recommended resolution (1920x1080) but that could be
expected since it was the highest resolution offered by Win+I > Display.

Then you have the maximum screenshot you can make.

I was hoping to "sharpen" the image where I was hoping there is a PDF tool
out there which does that - much like Irfanview sharpens a JPEG snapshot.

If you want more, then it is not a screenshot any more, because it
can't be from your screen.

The question is all about finding a PDF screenshotter (c.t.p) that can be
edited to be sharper (c.e) on the Windows platform (a.c.o.w-10).

The situation, summarized is:
a. How do you obtain a sharper screenshot than your resolution (windows).
b. Once you have the screenshots, how do you assemble 200 of them (PDF).
c. And is there an editor out there that sharpens the images (editors).

Otherwise, my approach (which is essentially the first approach that I took
to the problem of assembling & sharpening 200 screenshots) is the best.

But I asked the question here in the hopes to find a better solution.

Frank Slootweg

2024-03-13 19:52:58 UTC

Oliver <***@invalid.net> wrote:
[...]

Post by Oliver
I was hoping to "sharpen" the image where I was hoping there is a PDF tool
out there which does that - much like Irfanview sharpens a JPEG snapshot.

Why do you want/need to do the sharpening in "a PDF tool"? Why can't
you just sharpen the screenshots?

[...]

Post by Oliver
a. How do you obtain a sharper screenshot than your resolution (windows).

See above.

BTW, have you considered using Windows Snipping Tool (instead of
IrfanView) to make the screenshots? (I don't know which of the two is
more suitable for your task.)

Post by Oliver
b. Once you have the screenshots, how do you assemble 200 of them (PDF).

As knuttle mentioned, IrfanView can make a multipage PDF from images,
not need for other tools (like Adobe Acrobat Writer which you mentioned).

Post by Oliver
c. And is there an editor out there that sharpens the images (editors).

See above. As to sharpening images, you may also want to look at the
free software of the major camera brands. I used Nikon's software.

Post by Oliver
Otherwise, my approach (which is essentially the first approach that I took
to the problem of assembling & sharpening 200 screenshots) is the best.
But I asked the question here in the hopes to find a better solution.

Oliver

2024-03-13 21:48:41 UTC

Post by Frank Slootweg

Post by Oliver
I was hoping to "sharpen" the image where I was hoping there is a PDF tool
out there which does that - much like Irfanview sharpens a JPEG snapshot.

Why do you want/need to do the sharpening in "a PDF tool"? Why can't
you just sharpen the screenshots?

I already tried that by snapping JPEG images in Irfanview and sharpening
them and then converting them to PDF (as the final thing is a book).

Your suggestion gave me an idea though, which is I can probably snapshot to
GIF or BMP which can be set to not have image compression.

Since it's a book of hundreds of pages, it might be better to snapshot as
TIFF without compression and then combine all the TIFFs.

I just ran a quick test in Irfanview and I was pleased to see the Irfanview
sharpen command works on all three non-compression file formats.

I wonder if the PDF experts know how to snapshot a screen to PDF at no
compression? Is that even possible?

Post by Frank Slootweg

Post by Oliver
a. How do you obtain a sharper screenshot than your resolution (windows).

BTW, have you considered using Windows Snipping Tool (instead of
IrfanView) to make the screenshots? (I don't know which of the two is
more suitable for your task.)

I know about the Snipping Tool as it was renamed over the years but I have
never used it except in the beginning long ago, and found Irfanview easier.

But maybe it has the ability to save PDFs without compression? Let me look.
https://www.google.com/search?q=windows+snipping+tool+save+to+pdf+uncompressed
a. Windowskey+shift+s
b. Draw the area with the mouse
c. Press control+p (which can only save as PDF, not any other format)

Oh my Gosh. I didn't know it would do what it did do. I happened to have a
Firefox session going which was a long page (off the screen) and the
snipping print above actually printed the ENTIRE page (which would have not
been captured had it been an Irfanview print of what's displaying on
screen!).

What's nice is the Windows snipping tool saved parts of the page not
displaying, so I'm going to need to see if it can display the entire book
when it's scrolling downward (not side to side) for hundreds of pages.

That might be the trick needed!

Post by Frank Slootweg

Post by Oliver
b. Once you have the screenshots, how do you assemble 200 of them (PDF).

As knuttle mentioned, IrfanView can make a multipage PDF from images,
not need for other tools (like Adobe Acrobat Writer which you mentioned).

I wasn't aware Irfanview can make a multipage PDF from multiple images.
I knew the "b" batch command could "convert" many images to just as many
PDFs but not that it would make a multi-page PDF from the images.

Let me google that for myself.
https://www.google.com/search?q=irfanview+make+multi-page+pdf+from+jpegs

You're right again!
https://irfanview-forum.de/forum/program/support/9258-

Irfanview: Options > Multipage images > Create multipage PDF
Then "Add Images" and then "Save" with PDF compression turned off!

Now that is nice! It eliminates the need for the Adobe Acrobat writer!
And it solves the issue of saving the images to PDF without compression.

I need to experiment with that, where my first thought is.
a. Save everything to an uncompressed format (GIF, BMP or TIFF).
b. Run Irfanview batch (b) to sharpen (which doesn't always make it better)
c. Run Irfanview to convert the results to an uncompressed PDF.

That solves a lot of the data collection problems.
I'm not sure if it solves the blur problem yet though.

Post by Frank Slootweg

Post by Oliver
c. And is there an editor out there that sharpens the images (editors).

See above. As to sharpening images, you may also want to look at the
free software of the major camera brands. I used Nikon's software.

I need to find a way to intelligently sharpen specific images which are
blurry words so it's always black on white (which is the problem set).

I wonder if they have software specifically for de-blurring text?

Carlos E.R.

2024-03-18 14:23:38 UTC

Post by Oliver

Post by Oliver
I was hoping to "sharpen" the image where I was hoping there is a PDF tool
out there which does that - much like Irfanview sharpens a JPEG snapshot.

Why do you want/need to do the sharpening in "a PDF tool"? Why can't
you just sharpen the screenshots?

I already tried that by snapping JPEG images in Irfanview and sharpening
them and then converting them to PDF (as the final thing is a book).
Your suggestion gave me an idea though, which is I can probably snapshot to
GIF or BMP which can be set to not have image compression.
Since it's a book of hundreds of pages, it might be better to snapshot as
TIFF without compression and then combine all the TIFFs.
I just ran a quick test in Irfanview and I was pleased to see the Irfanview
sharpen command works on all three non-compression file formats.
I wonder if the PDF experts know how to snapshot a screen to PDF at no
compression? Is that even possible?

Why no compression? Compression is not the same as lossy.

--
Cheers, Carlos.

Paul

2024-03-14 05:14:08 UTC

Post by Frank Slootweg
[...]

Post by Oliver
I was hoping to "sharpen" the image where I was hoping there is a PDF tool
out there which does that - much like Irfanview sharpens a JPEG snapshot.

Why do you want/need to do the sharpening in "a PDF tool"? Why can't
you just sharpen the screenshots?
[...]

Post by Oliver
a. How do you obtain a sharper screenshot than your resolution (windows).

See above.
BTW, have you considered using Windows Snipping Tool (instead of
IrfanView) to make the screenshots? (I don't know which of the two is
more suitable for your task.)

Post by Oliver
b. Once you have the screenshots, how do you assemble 200 of them (PDF).

As knuttle mentioned, IrfanView can make a multipage PDF from images,
not need for other tools (like Adobe Acrobat Writer which you mentioned).

Post by Oliver
c. And is there an editor out there that sharpens the images (editors).

See above. As to sharpening images, you may also want to look at the
free software of the major camera brands. I used Nikon's software.

Post by Oliver
Otherwise, my approach (which is essentially the first approach that I took
to the problem of assembling & sharpening 200 screenshots) is the best.
But I asked the question here in the hopes to find a better solution.

Snippingtool used to be a good tool.

It's had its ups and downs. It has been fiddled
by morons, and *broken* more than once.

The other day, it was taking a snapshot, then
something would segfault or otherwise blow out,
and *no* window would appear with my snapshot in it.

When will these morons learn that you "work on software
until it is perfect" and "then STOP FUCKING WITH IT!!!!!!!!!!!!!!!".

Oh, well.

As a consequence, I cannot really saddle people with
a copy of Snippingtool, for as long as its
day to day disposition (the blowout last week)
are still happening. Why have an unreliable SaaS thing,
when you can have a reliable/unchanging ancient
piece of software ?

Right now, I rely upon GIMP screenshot capability,
as my "known-to-work" solution. It's more steps, but
when your SnippingTool is doing something erratic,
you take the high road.

Paul

VanguardLH

2024-03-13 20:59:18 UTC

Post by Oliver

Post by VanguardLH

Post by Oliver
I couldn't find an out-of-print book online no matter how hard I tried.
Google Books had it, of course, but it was missing pages.

When you looked at the book at Google Books, is the ISBN number listed?
A search on that might prove more fruitful. Cannot do the search for
you since neither the book, author, or ISBN was mentioned.

If I can't find it, you'll never find it. Nobody will if I can't.
I wouldn't have asked the question if ...

We don't know all the ifs of how you searched. Geez, lighten up.
Someone tries to help, and all you can do is lambaste the respondent
rather than say "Yes, I already tried searching on the ISBN."

You're a teacher?

Post by Oliver
I am not asking people to run my searches, as I am writing scripts to run
those searches as you can see in the other thread you had responded to.
<https://www.novabbs.com/computers/article-flat.php?id=77704&group=alt.comp.os.windows-10#77704>

Yeah, you're creating a Rube Goldberg solution over there while casting
out all your students on non-Windows platforms.

Post by Oliver
The main question is how to get a better quality screenshot to PDF.

https://www.google.com/googlebooks/perspectives/facts.html

Question: Why do some in-copyright books have full pages visible?
Answer: Whenever you can see more than a few snippets of an in-copyright
book in Google Books, it's because the author or publisher has joined
our Partner Program and granted us permission to show you the Sample
Pages View, which helps you learn enough about a book to know whether
you want to buy it. This is something we do with a publisher's explicit
permission.

Question: Can I download books for free using Google Books?
Answer: Google Books helps you search within and discover books. When
you find a book that's still under copyright, you'll typically see only
a small portion of the book at a time – either the Snippet View or the
Sample Pages View – plus links to places where you can buy or borrow it.
Some publishers have set their in-copyright books to Full Book View. If
you find a book that's out of copyright, we're also able to display the
Full Book View.

Is the copyright still valid on whatever is this unidentified book you
want to find? If so, your "students" probably appreciate your theft.
That a book is out of print does not excuse you from a copyright.
Copyright lasts for the life of the author plus an additional 70 years
if published after Jan-1-1978, or 95 years if published earlier. Must
be a list of very old books. No one deciding to print a book does not
nullify its copyright.

Apparently your intent is to violate copyright. When Google shows just
snippets of a book, that complies with copyright law. When they show
the full book, that's with permission, or the copyright lapsed. You
want to distribute books regardless of active copyrights.

Oliver

2024-03-13 22:02:18 UTC

Post by VanguardLH
We don't know all the ifs of how you searched. Geez, lighten up.
Someone tries to help, and all you can do is lambaste the respondent
rather than say "Yes, I already tried searching on the ISBN."

Well, you're a better software user than I am if you can find the
downloadable PDF for this particular book which can be bought but which
isn't available in any free PDF other than via the Google Books method
(missing pages) or archive.org (full book but you can only borrow it for
one hour at a time).

A legal history of the civil war and reconstruction A nation of rights
By Laura F. Edwards, 2015, 212pgs, ISBN 1107008794 & 978-1107008793

I'm NOT asking you to run a search for me, so I'll run it myself on
that particular item, where I'm well aware of what exists and what does not
exist in terms of book sites as explained in the related thread here.
<https://www.novabbs.com/computers/article-flat.php?id=77704&group=alt.comp.os.windows-10#77704>

But I'm now answering your question of finding the PDF online for free.

It's available via multiple library logins of course, but the quest would
be for a book that is NOT available on the net other than in archive.org.
1. https://stars.library.ucf.edu/etextbooks/222/
2. https://academic.oup.com/ajlh/article-abstract/56/2/299/2195548
3. https://www.proquest.com/docview/1788738404?sourcetype=Scholarly%20Journals
4. https://academic.oup.com/ahr/article-abstract/121/2/572/2582064
5. https://www.cambridge.org/core/books/abs/legal-history-of-the-civil-war-and-reconstruction/index/AE32B6F1403F78147EF3F5E03DFF0BE1
(and so on)

Again, I'm not going to ask you to google that for me, as I can
run all the searches you can run, so that's not what I need.
1. https://archive.org/details/legalhistoryofci0000edwa (one hour free borrow only)
2. https://openlibrary.org/search?q=A+legal+history+of+the+civil+war+and+reconstruction+A+nation+of+rights&mode=everything
3. https://www.google.nl/books/edition/A_Legal_History_of_the_Civil_War_and_Rec/0vB4BgAAQBAJ
4. https://books.google.com/books/about/A_Legal_History_of_the_Civil_War_and_Rec.html?id=0vB4BgAAQBAJ
5. https://books.google.com/books?id=9asPBgAAQBAJ&source=gbs_book_other_versions
6. https://books.google.com/books?hl=en&lr=&id=9asPBgAAQBAJ&oi=fnd&pg=PR11&dq=A+legal+history+of+the+civil+war+and+reconstruction+A+nation+of+rights&ots=0uIAA7OpUU&sig=mPzfC4IDyLL8gzrpgSefWE8WqWo#v=onepage&q=A%20legal%20history%20of%20the%20civil%20war%20and%20reconstruction%20A%20nation%20of%20rights&f=false
7. https://www.readanybook.com/search?q=A%20legal%20history%20of%20the%20civil%20war%20and%20reconstruction%20A%20nation%20of%20rights
(and so on)

The question was never about running the search.

The question was about the technical problem of
a. Screenshotting hundreds of pages of a book
b. That only shows one page at a time
c. And which is kind of blurry so it needs sharpening
d. And then it needs reassembling
e. And likely a lot of compression as a result of individual files
each with embedded fonts

Some really good answers have come about from this question already.
Thanks for your help and advice.

VanguardLH

2024-03-14 00:00:02 UTC

Post by Oliver
ISBN 1107008794

First hit in my Google search:

https://www.google.com/search?q=ISBN+1107008794

pointed to:

https://www.amazon.com/Legal-History-Civil-War-Reconstruction/dp/1107008794

I also searched Google Shopping on the title, and it pointed to the same
Amazon page. The above Google search found it at eBay:

https://www.ebay.com/p/202604172

Hardcover is pricey at $90 to $95. Paperback is cheaper at $11 to $26.
At Amazon, the seller is amazon.com, not some 3rd party using Amazon as
a store frontend. A further search at eBay on the title found some
cheaper options:

https://www.ebay.com/sch/i.html?_from=R40&_nkw=A+legal+history+of+the+civil+war+and+reconstruction+A+nation+of+rights&_sacat=0&_odkw=ISBN+1107008794&_osacat=0

You'll get the book, not a .PDF file or URL pointing to a PDF of the
book. I clicked on the "See all formats and editions", but just paper
copies were shown; however, there a Kindle e-book mentioned, the
cheapest format, so maybe you could create a .pdf from that assuming
Kindle doesn't enforce some DRM on still-copyrighted works.

To copy [violate the copyright] the full content to others, you'll need
a scanner. When I use the software bundled with my scanner, and for
document mode, it will let me keep scanning to append all scanned pages
into one document, like a PDF. A search online shows there are many
EPUB to PDF converters. Calibre can do doc conversion, too.

https://manual.calibre-ebook.com/conversion.html

I haven't bothered with Amazon's Kindle. Maybe they use an e-pub format
that isn't proprietary. From the video, Calibre can convert from EPUB
or AZW3 to PDF.

https://en.wikipedia.org/wiki/Kindle_File_Format

Other places I found selling that book:
https://www.browseaboutbooks.com/book/9781107008793
https://www.vitalsource.com/products/a-legal-history-of-the-civil-war-and-laura-f-edwards-v9781316234044
https://www.libroworld.com/9781107008793/
https://ca.biblio.com/book/legal-history-civil-war-reconstruction-nation/d/1417002433

If these really are students, don't they have to buy their own textbooks
for a class? Second to tuition, class books were damn expensive when I
went to the university. I bought the used ones if I wasn't going to
keep the book (by selling it back to the bookstore provided later
classes used the same book). Calibre can run on multiple platforms, so
maybe your students could get the cheap Kindle format. Them buying the
book eliminates you violating the copyright. Since the publication date
is 2015, the copyright has a lot longer to expire (75 years) even if the
author died the moment of publication.

Now it's time to retract your claim of "If I can't find it, you'll never
find it. Nobody will if I can't."

knuttle

2024-03-14 00:05:55 UTC

Post by VanguardLH

Post by Oliver
ISBN 1107008794

https://www.google.com/search?q=ISBN+1107008794
https://www.amazon.com/Legal-History-Civil-War-Reconstruction/dp/1107008794
I also searched Google Shopping on the title, and it pointed to the same
https://www.ebay.com/p/202604172
Hardcover is pricey at $90 to $95. Paperback is cheaper at $11 to $26.
At Amazon, the seller is amazon.com, not some 3rd party using Amazon as
a store frontend. A further search at eBay on the title found some
https://www.ebay.com/sch/i.html?_from=R40&_nkw=A+legal+history+of+the+civil+war+and+reconstruction+A+nation+of+rights&_sacat=0&_odkw=ISBN+1107008794&_osacat=0
You'll get the book, not a .PDF file or URL pointing to a PDF of the
book. I clicked on the "See all formats and editions", but just paper
copies were shown; however, there a Kindle e-book mentioned, the
cheapest format, so maybe you could create a .pdf from that assuming
Kindle doesn't enforce some DRM on still-copyrighted works.
To copy [violate the copyright] the full content to others, you'll need
a scanner. When I use the software bundled with my scanner, and for
document mode, it will let me keep scanning to append all scanned pages
into one document, like a PDF. A search online shows there are many
EPUB to PDF converters. Calibre can do doc conversion, too.
https://manual.calibre-ebook.com/conversion.html
http://youtu.be/D7GqNMo5GM4
I haven't bothered with Amazon's Kindle. Maybe they use an e-pub format
that isn't proprietary. From the video, Calibre can convert from EPUB
or AZW3 to PDF.
https://en.wikipedia.org/wiki/Kindle_File_Format
https://www.browseaboutbooks.com/book/9781107008793
https://www.vitalsource.com/products/a-legal-history-of-the-civil-war-and-laura-f-edwards-v9781316234044
https://www.libroworld.com/9781107008793/
https://ca.biblio.com/book/legal-history-civil-war-reconstruction-nation/d/1417002433
If these really are students, don't they have to buy their own textbooks
for a class? Second to tuition, class books were damn expensive when I
went to the university. I bought the used ones if I wasn't going to
keep the book (by selling it back to the bookstore provided later
classes used the same book). Calibre can run on multiple platforms, so
maybe your students could get the cheap Kindle format. Them buying the
book eliminates you violating the copyright. Since the publication date
is 2015, the copyright has a lot longer to expire (75 years) even if the
author died the moment of publication.
Now it's time to retract your claim of "If I can't find it, you'll never
find it. Nobody will if I can't."

Please don't quote me but as I remember Kindle and other similar ebooks
are a proprietary version of the standard PDF file. that may be another
approach to get what you want.

VanguardLH

2024-03-14 00:59:44 UTC

The attribution line you omitted in your reply to me.

Post by knuttle

Post by VanguardLH
Now it's time to retract your claim of "If I can't find it, you'll never
find it. Nobody will if I can't."

Please don't quote me but as I remember Kindle and other similar ebooks
are a proprietary version of the standard PDF file. that may be another
approach to get what you want.

I didn't quote nor reply to you. I quoted Oliver's claim in his article
(Message-ID: <usrpva$sb4h$***@dont-email.me>), and I replied to Oliver
about retracting *his* claim.

Are you now Oliver?

In your other subthread (Message-ID: <uss7as$v364$***@dont-email.me>), my
filters hid your post along with any replies to it. As a consequence, I
did not read nor respond in that subthread. Anyone who posts here using
Base64 is not complying to netiquette in text-only newsgroups. Posting
in Base64 is a known defect with Thunderbird. I filter out Base64
posters.

Oliver

2024-03-14 01:32:53 UTC

Post by VanguardLH
Now it's time to retract your claim of "If I can't find it, you'll never
find it. Nobody will if I can't."

You did not find it.

I gave you an example of something that has no free downloadable PDF.
You didn't find a free downloadable PDF.

Neither did I. And that was why I gave you that example.
We didn't find it probably because it likely doesn't exist on the net.

If you can find the free PDF, then you're a better searcher than I am.
But I doubt you are any better than I am because I'm pretty good.

Anyway, the point of this question isn't the free downloadable PDF.
You made the question into the free downloadable PDF - not me.

The point is the software to MAKE that PDF out of displayed pages.
The trick is going to be to de-blur those pages.

I'm thinking there may be software someone alluded to that will take images
of books (which are black text on a white paper background) & de-blur them.

Do you know of any software that does that de-blurring specific for text?
(Sharpen is ubiquitous but it's not good enough in my tests yesterday.)

Because this thread is about improving the process of creating a PDF
when all you have are the visible pages (as from archive.org).

Carlos E.R.

2024-03-18 14:15:29 UTC

Post by Oliver

Post by VanguardLH
Now it's time to retract your claim of "If I can't find it, you'll never
find it. Nobody will if I can't."

You did not find it.
I gave you an example of something that has no free downloadable PDF.
You didn't find a free downloadable PDF.

Why should we help you to steal a book, and not pay for it?

Just buy the book.

--
Cheers, Carlos.

Julian Bradfield

2024-03-14 10:41:37 UTC

Post by Oliver
Well, you're a better software user than I am if you can find the
downloadable PDF for this particular book which can be bought but which

Why should this group help a pirate? It's not even an expensive
book. If you live in a developed economy, you've already spent far
more than the cost of the book in terms of your time trying to avoid
paying for it.

Post by Oliver
A legal history of the civil war and reconstruction A nation of rights
By Laura F. Edwards, 2015, 212pgs, ISBN 1107008794 & 978-1107008793

Newyana2

2024-03-14 11:39:15 UTC

"Julian Bradfield" <***@inf.ed.ac.uk> wrote

| Why should this group help a pirate? It's not even an expensive
| book.

Indeed. This project sounds so farfetched and nonsensical,
as well as blatantly illegal, that I suspect the OP is doing
research on attitudes about theft of copyrighted material.
Maybe it's a college student, maybe cops, or maybe it's
an AI bot, honing its "skills".

Paul

2024-03-14 15:39:35 UTC

Post by Newyana2
| Why should this group help a pirate? It's not even an expensive
| book.
Indeed. This project sounds so farfetched and nonsensical,
as well as blatantly illegal, that I suspect the OP is doing
research on attitudes about theft of copyrighted material.
Maybe it's a college student, maybe cops, or maybe it's
an AI bot, honing its "skills".

Did anyone write a script or commit a DMCA-punishable crime ?
Nope.

Some of us know what our limits are, from a legal perspective.
And we're not going to leave a record of our activities in
an open forum like this, now are we.

You would be surprised, just how chilling DMCA-crime is.
It's tentacles are long. It's a law you use when a DA
swings a big dick.

The interesting one, was the article in Medium.com , where
the article author claims he did a mass attack and glued
together Google "snippets" to make a book. Which is silly to start
with (it's not necessarily going to look like a book). But...
he was careful to obscure his code to carry out the procedure.
Reason. Um... Um... Gee, why would he not put his code in that
paid Medium article of his ? A normal analysis of the code, would
not reveal wrong-doing. Um... um...

This is one reason, that currently academic researchers
will no longer touch Skype, with a barge pole. To start with,
it's no longer an "exceptional" piece of work. It's less
interesting to study. But after the legal opinion given
on the study of the original Skype, the application of DMCA-crime
is now overreaching enough, that only Blackhats will be studying
the holes in the new Skype.

When is the last time you read any articles on cracking WPA3
(after the initial version was shown by cryptographers, before
it was really officially deployed, to be unsound). Well, no one
does that now.

So yes, I think the audience is vaguely aware of how much
cajoling we can do, and then... we walk away.

This is double-ungood, citizen.

This is like when the local crack dealer, leaves baggies of crack
all over neighbourhood sidewalks. If you pick up a bag, an
officer comes over and clamps the handcuffs on you. Just another
day in the neighbourhood.

Paul

Frank Slootweg

2024-03-14 16:29:44 UTC

Post by Newyana2
| Why should this group help a pirate? It's not even an expensive
| book.
Indeed. This project sounds so farfetched and nonsensical,
as well as blatantly illegal, that I suspect the OP is doing
research on attitudes about theft of copyrighted material.
Maybe it's a college student, maybe cops, or maybe it's
an AI bot, honing its "skills".

Well, we already knew that 'Arlen Holder' wants/demands all software
for free, but now he even wants to pirate copyrighted material!

Post by Newyana2
I couldn't find an out-of-print book online no matter how hard I tried.

The "out-of-print" sort of implied that it wasn't available, but the
facts show it *is* available and it's even available "online", but just
not for free - but cheap - and (AFAWK) not in electronic form.

I'm sorry that I fell for his scam.

Oliver

2024-03-15 01:34:13 UTC

Post by Newyana2
This project sounds so farfetched and nonsensical,
as well as blatantly illegal, that I suspect the OP is doing
research on attitudes about theft of copyrighted material.
Maybe it's a college student, maybe cops, or maybe it's
an AI bot, honing its "skills".

The only reason I gave that example was to point out to the VanguardLH
poster that there are books out there that have no easily available PDF.

The script is written to use any list of sites that are publicly
available on the Internet - and which are perfectly legal to use.

Those lists came from web articles on ebooks that are legit articles.
[PCMag][https://www.pcmag.com/how-to/download-free-ebooks-audiobooks-online]
[PDFTech][https://www.pdfreaderpro.com/blog/download-free-pdf-books]
[MakeUseOf][https://www.makeuseof.com/tag/download-books-for-free-from-google-books/]
[PDFGear][https://www.pdfgear.com/pdf-converter/free-ebook-download-sites.htm]

Some of those commonly available sites for ebooks are:
[Authorama][http://www.authorama.com/]
[Ebooks Free][https://www.ebooks.com/en-us/free/]
[Feedbooks][https://www.feedbooks.com/catalog/public_domain]
[Free Ebooks][https://www.free-ebooks.net/]
[Free Engineering Books][http://www.freeengineeringbooks.com/]
[Google Scholar][https://scholar.google.com/]
[Gutenberg Books][https://www.gutenberg.org/]
[Internet Book Archive][https://archive.org/]
[Manybooks Public Domain][https://manybooks.net/]
[PDFGet][https://pdfget.com/]
[Science Government Books][https://www.science.gov/]
[UPenn Online Books Page][https://digital.library.upenn.edu/books/]

Why is everyone claiming those sites are so sinister?
Have they never searched for a publicly available PDF before?

Newyana2

2024-03-15 12:36:31 UTC

"Oliver" <***@invalid.net> wrote

| The only reason I gave that example was to point out to the VanguardLH
| poster that there are books out there that have no easily available PDF.
|

Yes. There's no download offered. Google is forcing you
to see only one page at a time, and usually not the whole book.
It's not for me to say whether it's morally right to copy the books.
Was it morally right for Disney to spend millions lobbying Congress
to extend copyright way beyond what the law intends, in what
was quite literally a Mickey Mouse case? No.

But there is law. It won't help to just say, "Hey, officer, the
car was just sitting there with the door unlocked. Looks free
to me. And I think the legal owner is a crook."

As far as I can see, there are 4 general categories. There are
free, legal downloads. Those may sometimes be PDFs, or they may
be only offered in formats like DAISY, intended only for the blind.

There are *available* downloads as PDF. A place like archive.org
might have those legally, or they may be illegal uploads that
archive.org hasn't been asked to take down. Some are out of
copyright. Some are not.

There are illegal offerings, from sites that come and go, just
as sites used to offer illegal software activation keys. They may
offer a PDF but the site itself is illegal, the distribution is illegal,
and publishers try to get them shut down.

Then there's what you're talking about: Hacking the available
formats to get a copy not offered. You could probably also
hack a DAISY version, but those versions are technically only legal
for use by the blind. Google or archive.org may be legally offering
access. Some books at archive.org are available for borrowing,
for example. So they may be acting as a library. Borrowing books
from a library is legal. Copying them is not.

It's a funny system. Copyright was intended to reimburse artists
for their work, not help Disney make billions from the work of
long-dead artists. Today copyright has been interpreted more as
property rights. If libraries didn't already exist they'd surely be
illegal. Tech companies have bent the law for their own purposes.
For example, Microsoft illegally claims that their copyrighted code
is licensed to inanimate objects -- motherboards -- thus claiming
that you have no right to own the copy of Windows that you bought.
Which goes against the Macy's ruling of 1909 that says you own
your copy and that you're free to do anything with it you like,
including selling it, so long as you don't distribute copies of your copy.
Similarly, movie companies keep trying to limit access, doing things
like imposing articial "wear and tear" limits on e-books that libraries
buy.

So there's that. Illegal exploitation of the law by copyright holders.
How do Microsoft and Hollywood get away with that? They have more
money, lawyers and Congressmen than you do. The copyright law is
malleable and US law generally serves plutocracy. That's why tech
companies have become the biggest lobbyists in Congress.

How we feel about that is a separate issue from the law itself. In the
case of Google, they went to court on fair use grounds and won the right
to scan, distribute snippets and possibly rent books. Authors were
trying to claim that merely scanning was illegal. But that case was
never about Google being able to give away books under copyright.
It was only about the use of new technology for distributing snippets
of books.

https://www.smithsonianmag.com/smart-news/court-ruling-legalizes-google-books-180956997/

knuttle

2024-03-15 17:58:28 UTC

Post by Newyana2
| The only reason I gave that example was to point out to the VanguardLH
| poster that there are books out there that have no easily available PDF.
|
Yes. There's no download offered. Google is forcing you
to see only one page at a time, and usually not the whole book.
It's not for me to say whether it's morally right to copy the books.
Was it morally right for Disney to spend millions lobbying Congress
to extend copyright way beyond what the law intends, in what
was quite literally a Mickey Mouse case? No.
But there is law. It won't help to just say, "Hey, officer, the
car was just sitting there with the door unlocked. Looks free
to me. And I think the legal owner is a crook."
As far as I can see, there are 4 general categories. There are
free, legal downloads. Those may sometimes be PDFs, or they may
be only offered in formats like DAISY, intended only for the blind.
There are *available* downloads as PDF. A place like archive.org
might have those legally, or they may be illegal uploads that
archive.org hasn't been asked to take down. Some are out of
copyright. Some are not.
There are illegal offerings, from sites that come and go, just
as sites used to offer illegal software activation keys. They may
offer a PDF but the site itself is illegal, the distribution is illegal,
and publishers try to get them shut down.
Then there's what you're talking about: Hacking the available
formats to get a copy not offered. You could probably also
hack a DAISY version, but those versions are technically only legal
for use by the blind. Google or archive.org may be legally offering
access. Some books at archive.org are available for borrowing,
for example. So they may be acting as a library. Borrowing books
from a library is legal. Copying them is not.
It's a funny system. Copyright was intended to reimburse artists
for their work, not help Disney make billions from the work of
long-dead artists. Today copyright has been interpreted more as
property rights. If libraries didn't already exist they'd surely be
illegal. Tech companies have bent the law for their own purposes.
For example, Microsoft illegally claims that their copyrighted code
is licensed to inanimate objects -- motherboards -- thus claiming
that you have no right to own the copy of Windows that you bought.
Which goes against the Macy's ruling of 1909 that says you own
your copy and that you're free to do anything with it you like,
including selling it, so long as you don't distribute copies of your copy.
Similarly, movie companies keep trying to limit access, doing things
like imposing articial "wear and tear" limits on e-books that libraries
buy.
So there's that. Illegal exploitation of the law by copyright holders.
How do Microsoft and Hollywood get away with that? They have more
money, lawyers and Congressmen than you do. The copyright law is
malleable and US law generally serves plutocracy. That's why tech
companies have become the biggest lobbyists in Congress.
How we feel about that is a separate issue from the law itself. In the
case of Google, they went to court on fair use grounds and won the right
to scan, distribute snippets and possibly rent books. Authors were
trying to claim that merely scanning was illegal. But that case was
never about Google being able to give away books under copyright.
It was only about the use of new technology for distributing snippets
of books.
https://www.smithsonianmag.com/smart-news/court-ruling-legalizes-google-books-180956997/

I don't not know the original posters intentions, but if he were doing
research, he has the right to make copies of the pages that effect his
research.

Hence the need to download pages from an online book.

He can use the information found on those pages in the results of his
published research, IF he properly credits the information.

If he enters the information from those pages verbatim, without credit
he is guilty of plagiarism. in his own words the information is not
plagiarized

The exact way this works varies from country to country.

Since it could be debateable if it were in his own words, that is why we
have the courts.

Oliver

2024-03-15 20:40:03 UTC

Post by knuttle
I don't not know the original posters intentions, but if he were doing
research, he has the right to make copies of the pages that effect his
research.
Hence the need to download pages from an online book.
He can use the information found on those pages in the results of his
published research, IF he properly credits the information.
If he enters the information from those pages verbatim, without credit
he is guilty of plagiarism. in his own words the information is not
plagiarized
The exact way this works varies from country to country.
Since it could be debateable if it were in his own words, that is why we
have the courts.

Every country is different, as you said, where Wikipedia covers the
doctrine of Fair Use (particularly in school environments) over here.

https://en.wikipedia.org/wiki/Fair_use

Oliver

2024-03-15 20:40:13 UTC

- Macy's selling a product(book) at a discount below a price the
publisher claimed as a minimum price by insertion of the minimum price
notice in the book. The Supreme Court ruled in Macy's favor, later
codified in the Copyright Act of 2009, which was later repealed and
superceded by the Copyright Act of 1976 but retained content applicable
to the earlier Supreme Court's Macy's ruling.

https://www.copyright.gov/fair-use/
About Fair Use

Fair use is a legal doctrine that promotes freedom of expression by
permitting the unlicensed use of copyright-protected works in certain
circumstances. Section 107 of the Copyright Act provides the statutory
framework for determining whether something is a fair use and identifies
certain types of uses-such as criticism, comment, news reporting, teaching,
scholarship, and research-as examples of activities that may qualify as
fair use. Section 107 calls for consideration of the following four factors
in evaluating a question of fair use:

Purpose and character of the use, including whether the use is of a
commercial nature or is for nonprofit educational purposes: Courts look at
how the party claiming fair use is using the copyrighted work, and are more
likely to find that nonprofit educational and noncommercial uses are fair.
This does not mean, however, that all nonprofit education and noncommercial
uses are fair and all commercial uses are not fair; instead, courts will
balance the purpose and character of the use against the other factors
below. Additionally, "transformative" uses are more likely to be considered
fair. Transformative uses are those that add something new, with a further
purpose or different character, and do not substitute for the original use
of the work.

Nature of the copyrighted work: This factor analyzes the degree to
which the work that was used relates to copyright's purpose of encouraging
creative expression. Thus, using a more creative or imaginative work (such
as a novel, movie, or song) is less likely to support a claim of a fair use
than using a factual work (such as a technical article or news item). In
addition, use of an unpublished work is less likely to be considered fair.

Amount and substantiality of the portion used in relation to the
copyrighted work as a whole: Under this factor, courts look at both the
quantity and quality of the copyrighted material that was used. If the use
includes a large portion of the copyrighted work, fair use is less likely
to be found; if the use employs only a small amount of copyrighted
material, fair use is more likely. That said, some courts have found use of
an entire work to be fair under certain circumstances. And in other
contexts, using even a small amount of a copyrighted work was determined
not to be fair because the selection was an important part-or the
"heart"-of the work.

Effect of the use upon the potential market for or value of the
copyrighted work: Here, courts review whether, and to what extent, the
unlicensed use harms the existing or future market for the copyright
owner's original work. In assessing this factor, courts consider whether
the use is hurting the current market for the original work (for example,
by displacing sales of the original) and/or whether the use could cause
substantial harm if it were to become widespread.

In addition to the above, other factors may also be considered by a court
in weighing a fair use question, depending upon the circumstances. Courts
evaluate fair use claims on a case-bycase basis, and the outcome of any
given case depends on a fact-specific inquiry. This means that there is no
formula to ensure that a predetermined percentage or amount of a work-or
specific number of words, lines, pages, copies-may be used without
permission.

Oliver

2024-03-15 23:57:23 UTC

i.e. Fair use was not an argument in the Macy case.

Nobody said this thread is about the Macy case.
Fair use applies to this thread.

Particularly when Fair Use (in the USA) has four provisions, every one of
which must be violated without reasonable doubt for it to be infringement.

https://www.copyright.gov/fair-use/

1. Purpose and character of the use, including whether the use is of a
commercial nature or is for nonprofit educational purposes

2. Nature of the copyrighted work: This factor analyzes the degree to which
the work that was used relates to copyright's purpose of encouraging
creative expression.

3. Amount and substantiality of the portion used in relation to the
copyrighted work as a whole: Under this factor, courts look at both the
quantity and quality of the copyrighted material that was used.

4. Effect of the use upon the potential market for or value of the
copyrighted work.

All four tenets MUST be violated for it to be deemed infringement.

Not one. Not two. Not three. But all four. Appreciably so.

Every single tenet must be appreciably violated (e.g., a blurry image of a
book could be judged as not the same thing as a crisp epub of the book in
terms of #3) and the use that it is made in terms of item #1 will also be
looked at in a court of law.

Most people know none of this. I'm sure people like VanguardLH have never
even heard of the term "fair use" in this context in their whole lives.

Julian Bradfield

2024-03-16 09:45:33 UTC

Post by Oliver
Particularly when Fair Use (in the USA) has four provisions, every one of
which must be violated without reasonable doubt for it to be infringement.

Rubbish. You have no understanding of your own law.

Go and read what the law actually says. Then read the case law.

Oliver

2024-03-16 12:35:52 UTC

On Sat, 16 Mar 2024 09:45:33 +0000 (UTC), Julian Bradfield

Post by Julian Bradfield
Rubbish. You have no understanding of your own law.
Go and read what the law actually says. Then read the case law.

Rubbish? WTF? No understanding? WTF? I cited well-known USA laws.
From the government copyright site.

For you to call cites to USA laws "Rubbish" speaks worlds about you.
https://www.copyright.gov/fair-use/

Your unilateral declaration that all US copyright laws on Fair Use are
"Rubbish" simply means either you viscerally dislike USA copyright law, or,
you were initially ignorant of USA Fair Use laws and you are shocked.

Most likely it's the latter as I've long known (for decades) that stupid
people have no concept that laws such as Fair Use even exist in the USA.

Yet, Fair Use is well established in the USA even if you never heard of it
prior to this thread, where you're shocked that the concept even exists.

From your wording & NNTP posting host, you appear to perhaps be in the UK.
Your UK laws on Fair Use can and most likely will differ.

Usually people who don't own the technical skills to comprehend something
as simple as a cite to the government description of the topic, won't have
even one tenth of the technical skills necessary to resolve technical
issues.

In this case, that technical issue is making the current process more
efficient.

Julian Bradfield

2024-03-16 19:58:35 UTC

Post by Oliver
On Sat, 16 Mar 2024 09:45:33 +0000 (UTC), Julian Bradfield

Post by Julian Bradfield
Rubbish. You have no understanding of your own law.
Go and read what the law actually says. Then read the case law.

Rubbish? WTF? No understanding? WTF? I cited well-known USA laws.

No, you didn't. You cited the four principles, without the
preceding text in the law, or the text on the page, which describe
how they are applied.

Post by Oliver

Post by Julian Bradfield
Particularly when Fair Use (in the USA) has four provisions, every one of
which must be violated without reasonable doubt for it to be infringement.

It's rubbish because that is not what the law says, as you can see for
yourself by reading either the law or the copyright page you referred to.

The rest of your post is created by replying to things I did not (and
would not) say and insulting me. There are words for that.

Oliver

2024-03-18 06:30:42 UTC

On Sat, 16 Mar 2024 19:58:35 +0000 (UTC), Julian Bradfield

Post by Julian Bradfield
The rest of your post is created by replying to things I did not (and
would not) say and insulting me. There are words for that.

Your "rubbish" comment was because you did not understand US laws.
The laws for such things are entirety laws.

Every single tenet MUST NOT exist for it to be infringement, and,
if a single tenet does exist, then it's not infringement.

Many laws work that way. Fraud for example.
You can't have half of a fraud in the USA.

Either every tenet is proven, or it's not fraud.
Same with Fair Use.

However, as I said, in your jurisdiction, the laws may be different.

But your misunderstanding of USA law does not make USA law "Rubbish".

Julian Bradfield

2024-03-18 08:14:53 UTC

Post by Oliver
On Sat, 16 Mar 2024 19:58:35 +0000 (UTC), Julian Bradfield

Post by Julian Bradfield
The rest of your post is created by replying to things I did not (and
would not) say and insulting me. There are words for that.

Your "rubbish" comment was because you did not understand US laws.
The laws for such things are entirety laws.
Every single tenet MUST NOT exist for it to be infringement, and,
if a single tenet does exist, then it's not infringement.

This is eimply false. Here is what the title says:
"In determining whether the use made of a work in any particular case
is a fair use the factors to be considered shall include—"
and then the four factors.

All these factors are gradient factors; the court does a balancing
exercise and decides, on the facts of each particular case, whether
the use is fair or not.
Libraries in the US generally work on the basis that a student can
safely copy up to 10% of a given book for educational use.

This is all explained for non-lawyers here:
https://www.copyright.gov/fair-use/index.html

Carlos E.R.

2024-03-18 14:21:16 UTC

Post by Oliver

- Macy's selling a product(book) at a discount below a price the
publisher claimed as a minimum price by insertion of the minimum price
notice in the book. The Supreme Court ruled in Macy's favor, later
codified in the Copyright Act of 2009, which was later repealed and
superceded by the Copyright Act of 1976 but retained content
applicable to the earlier Supreme Court's Macy's ruling.

https://www.copyright.gov/fair-use/
About Fair Use
Fair use is a legal doctrine that promotes freedom of expression by
permitting the unlicensed use of copyright-protected works in certain
circumstances. Section 107 of the Copyright Act provides the statutory
framework for determining whether something is a fair use and identifies
certain types of uses-such as criticism, comment, news reporting, teaching,
scholarship, and research-as examples of activities that may qualify as
fair use. Section 107 calls for consideration of the following four factors

But they don't give you the right to download an entire book for free.

--
Cheers, Carlos.

Oliver

2024-03-15 20:40:19 UTC

Post by Newyana2
| The only reason I gave that example was to point out to the VanguardLH
| poster that there are books out there that have no easily available PDF.
Yes. There's no download offered.

+1. Agreed.

The only point of picking that one specific book example was to prove to
VanguardLH (who had *insisted* that he could find a free book PDF even
though it doesn't seem to exist), so I tested out his claim. He failed.

I'm no slouch when it comes to book searches, mind you, so I knew
VanguardLH would fail - even after he declared success - he didn't
understand that the goal is creating a PDF of snapshots of any book that
doesn't have a freely available PDF download - but which has the entire
book in some kind of freely available viewing engine (like archive.org).

Post by Newyana2
Google is forcing you
to see only one page at a time, and usually not the whole book.

+1. Agreed.

I knew that before I checked but what I checked is I went back a few times,
and it's not feasible, but at least it's potentially possible to get every
page from Google Books if you visit using enough different fingerprints.

But I wouldn't do it.
The easier way is, by far, to find the freely downloadable PDF.
The next easiest way is what the subject of this thread is really about.

Post by Newyana2
It's not for me to say whether it's morally right to copy the books.

Most people are ignorant of copyright law, much like they're as ignorant of
most laws, whether they're trespassing laws, fraud laws or copyright laws.

Each of those laws has MULTIPLE tenets all of which MUST exist, in order
for it to be a crime where simply pointing to a file is NOT a crime, no
matter how much VanguardLH insists that a distributing a mere URL to a book
is a crime. Clearly people who are like VanguardLH is have absolutely no
concept of copyright law, and in particular, people who scream, as he did,
that downloading a publicly available file from a URL is illegal probably
aren't even aware that there is a concept of fair use doctrine in the USA.
https://duckduckgo.com/?t=ffab&q=copyright+law+fair+use

Post by Newyana2
Was it morally right for Disney to spend millions lobbying Congress
to extend copyright way beyond what the law intends, in what
was quite literally a Mickey Mouse case? No.

Same with cellphone companies who wanted to stop network unlocking by
claiming it was a copyright infringement to unlock the phone long ago.

Post by Newyana2
But there is law. It won't help to just say, "Hey, officer, the
car was just sitting there with the door unlocked. Looks free
to me. And I think the legal owner is a crook."

Look up Fair Use (see search link above). Every single tenet MUST exist for
it to even begin to be considered for copyright infringement.
https://www.copyright.gov/fair-use/

Post by Newyana2
As far as I can see, there are 4 general categories. There are
free, legal downloads. Those may sometimes be PDFs, or they may
be only offered in formats like DAISY, intended only for the blind.
There are *available* downloads as PDF. A place like archive.org
might have those legally, or they may be illegal uploads that
archive.org hasn't been asked to take down. Some are out of
copyright. Some are not.
There are illegal offerings, from sites that come and go, just
as sites used to offer illegal software activation keys. They may
offer a PDF but the site itself is illegal, the distribution is illegal,
and publishers try to get them shut down.
Then there's what you're talking about: Hacking the available
formats to get a copy not offered. You could probably also
hack a DAISY version, but those versions are technically only legal
for use by the blind. Google or archive.org may be legally offering
access. Some books at archive.org are available for borrowing,
for example. So they may be acting as a library. Borrowing books
from a library is legal. Copying them is not.

This is not a true statement. You can copy books if you want but you havef
to follow the four tenets of fair use doctrine (at least in the USA).
1. Purpose and character of the use
2. Nature of the copyrighted work
3. Substantiality of the portion used (note quantity & *quality*!)
4. Effect of the use upon the potential market

Those are explained further in the cite prior, where most people
fundamentally misunderstand copyright law (and fair use in particular).

Post by Newyana2
It's a funny system. Copyright was intended to reimburse artists
for their work, not help Disney make billions from the work of
long-dead artists. Today copyright has been interpreted more as
property rights. If libraries didn't already exist they'd surely be
illegal. Tech companies have bent the law for their own purposes.
For example, Microsoft illegally claims that their copyrighted code
is licensed to inanimate objects -- motherboards -- thus claiming
that you have no right to own the copy of Windows that you bought.
Which goes against the Macy's ruling of 1909 that says you own
your copy and that you're free to do anything with it you like,
including selling it, so long as you don't distribute copies of your copy.
Similarly, movie companies keep trying to limit access, doing things
like imposing articial "wear and tear" limits on e-books that libraries
buy.
So there's that. Illegal exploitation of the law by copyright holders.
How do Microsoft and Hollywood get away with that? They have more
money, lawyers and Congressmen than you do. The copyright law is
malleable and US law generally serves plutocracy. That's why tech
companies have become the biggest lobbyists in Congress.
How we feel about that is a separate issue from the law itself. In the
case of Google, they went to court on fair use grounds and won the right
to scan, distribute snippets and possibly rent books. Authors were
trying to claim that merely scanning was illegal. But that case was
never about Google being able to give away books under copyright.
It was only about the use of new technology for distributing snippets
of books.
https://www.smithsonianmag.com/smart-news/court-ruling-legalizes-google-books-180956997/

I am well aware of the google situation as we studied it in school.
I had to do a paper on it myself, so I'm familiar with the arguments.

It turns out that under the four tenets of fair use, Google was permitted
to display the book, where you have to keep in mind the quality factor is
one component of whether there is copyright infringement and also the
potential effect on the market.

It's not up to you or me but to a judge whether a barely readable blurry
non-text screenshot of already blurry book pages is going to have a
potentially huge effect on the market for an out of print book. :)

We can summarize as anyone who says it's copyright infringement doesn't
know enough about anything to help anyone so it doesn't matter that they're
wrong - it only matters that they don't own the computer skills needed.

Herbert Kleebauer

2024-03-14 11:15:35 UTC

Post by Oliver
3. https://www.google.nl/books/edition/A_Legal_History_of_the_Civil_War_and_Rec/0vB4BgAAQBAJ
The question was never about running the search.
The question was about the technical problem of
a. Screenshotting hundreds of pages of a book
b. That only shows one page at a time
c. And which is kind of blurry so it needs sharpening
d. And then it needs reassembling
e. And likely a lot of compression as a result of individual files
each with embedded fonts
Some really good answers have come about from this question already.
Thanks for your help and advice.

I don't understand why you make screen shoots. For example,
if displaying a page of the above link in Firefox, select
"Save Frame As". In the saved directory you will find
the original jpg of the page (size 1280x1978 pixel).

I suppose, if you scroll through the book, somewhere
in the browser cache all the pages should be stored,
so you can copy them all at once.

Oliver

2024-03-13 20:59:30 UTC

I believe the OP does have at least one computer with a larger screen.
I think his screen is larger than the one I've got.

You're correct my screen is larger than my refrigerator (almost) in that
it's the Sharp LC-45GD7U (the screen only of which I just measured
physically, with a ruler, to be about 36 inches wide & 22 inches tall).

I tried to flip sidewise the book that I could only borrow for one hour on
archive.org or google books where that would have helped a lot!

But I didn't know how to flip a Firefox TOR web page display sidewise.

At one time, Windows used to support "pan mode" and you could define
a virtual resolution in the MVidia "old" control panel. That
capability was removed from both NVidia and ATI/AMD interfaces
(at roughly the same time the PowerStrip developer shut down),
which tells you Microsoft wanted pan mode removed.

Ah, that's interesting. It's the one solution suggested so far that I had
not immediately thought of already - which is to see if the driver I have
supports better resolution on my dual monitor setup.

My driver is... let me look... using the (deprecated) DuMo software...
NVIDIA GeForce GTX 750 Ti driver version 31.0.15.3770

If I can "landscape" the web page view - that would add pixels immediately.
Can that be done?

Linux still has pan mode. This is from my notes file, and has some
suggestions for configuring a limited/specific set of distros for taking shots.
# Remove ugly backup picture from desktop, replace with color [Gnome3?]
gsettings set org.gnome.desktop.background picture-uri none
gsettings set org.gnome.desktop.background primary-color '#3ea5a6"
# Set virtual resolution, desktop "pans" when mouse bumps edge of screen
# Type bare "xrandr" command to get your output port names.
xrandr --output HDMI-0 --panning 1280x10000
# Dump a snapshot of the screen into the specified output file.
# The purpose of the sleep 10, is to allow the user to flip back to
# the item being captured, and say, select a menu with a mouse to compose a shot.
sleep 10 ; xwd -root -out xwd.xwdump
But on modern Windows, that's been removed. Presumably part of PVP
or its successor (where 4K BluRays are decoded in an Enclave on the CPU
and other sneaky things).

I tried to use the Adobe Acrobat (writer) capability of opening an entire
web page (every page in the web site down from a given level) but it just
ended up giving me the archive.org home page level and not the pages of the
book.

I do think there is magic there though in that capability of Adobe Acrobat
to turn an entire web site into a linked PDF (much like Paul already does
with the Windows freeware WkHtmlToPDF tool (https://wkhtmltopdf.org/).

I just have to figure out how to get Adobe Acrobat writer to use the
URL to a book and then follow through with every page of the book,
but the URL remains static for the entire book by some magic unknown to me.

That's why I'm asking for help from the experts because if it was easy, I
would have done it already and I wouldn't need to be asking for advice.

Paul

2024-03-14 05:06:43 UTC

Post by Oliver

I believe the OP does have at least one computer with a larger screen.
I think his screen is larger than the one I've got.

You're correct my screen is larger than my refrigerator (almost) in that
it's the Sharp LC-45GD7U (the screen only of which I just measured
physically, with a ruler, to be about 36 inches wide & 22 inches tall).
I tried to flip sidewise the book that I could only borrow for one hour on
archive.org or google books where that would have helped a lot!
But I didn't know how to flip a Firefox TOR web page display sidewise.

At one time, Windows used to support "pan mode" and you could define
a virtual resolution in the MVidia "old" control panel. That
capability was removed from both NVidia and ATI/AMD interfaces
(at roughly the same time the PowerStrip developer shut down),
which tells you Microsoft wanted pan mode removed.

Ah, that's interesting. It's the one solution suggested so far that I had
not immediately thought of already - which is to see if the driver I have
supports better resolution on my dual monitor setup.
My driver is... let me look... using the (deprecated) DuMo software...
NVIDIA GeForce GTX 750 Ti driver version 31.0.15.3770
If I can "landscape" the web page view - that would add pixels immediately.
Can that be done?

Linux still has pan mode. This is from my notes file, and has some
suggestions for configuring a limited/specific set of distros for taking shots.
   # Remove ugly backup picture from desktop, replace with color [Gnome3?]
   gsettings set org.gnome.desktop.background picture-uri none
   gsettings set org.gnome.desktop.background primary-color '#3ea5a6"
   # Set virtual resolution, desktop "pans" when mouse bumps edge of screen
   # Type bare "xrandr" command to get your output port names.
   xrandr --output HDMI-0 --panning 1280x10000
   # Dump a snapshot of the screen into the specified output file.
   # The purpose of the sleep 10, is to allow the user to flip back to
   # the item being captured, and say, select a menu with a mouse to compose a shot.
   sleep 10 ; xwd -root -out xwd.xwdump
But on modern Windows, that's been removed. Presumably part of PVP
or its successor (where 4K BluRays are decoded in an Enclave on the CPU
and other sneaky things).

I tried to use the Adobe Acrobat (writer) capability of opening an entire
web page (every page in the web site down from a given level) but it just
ended up giving me the archive.org home page level and not the pages of the
book.
I do think there is magic there though in that capability of Adobe Acrobat
to turn an entire web site into a linked PDF (much like Paul already does
with the Windows freeware WkHtmlToPDF tool (https://wkhtmltopdf.org/).
I just have to figure out how to get Adobe Acrobat writer to use the URL to a book and then follow through with every page of the book, but the URL remains static for the entire book by some magic unknown to me.
That's why I'm asking for help from the experts because if it was easy, I
would have done it already and I wouldn't need to be asking for advice.

You know there's a display control panel, whereby two terminals
can be placed side by side. They can really have any sort of
(x,y) offset you want, with the windows touching with little
overlap between them. One panel can be stacked on top of the
other.

Well, terminals can be rotated, too.

*******

The *best* way to do this, is with an LCD panel that has a rotation sensor
inside. The EDID is fiddled, such that when the monitor is physically
rotated, a switch closure on the rotation sensor, tells the panel it's
been rotated. The EDID tells Windows that a "new terminal is connected
and it happens to be rotated". Windows then changes the scan order
in the crossbar counters and so on, so that the pattern comes out
correctly for the new orientation. The end result is text has the
normal orientation, and the monitor is now Portrait instead of Landscape.

If, on the other hand, you rotate a terminal and it has no sensor, then
you have to manually do that in the display control panel (somehow).
Yes, I've seen four options for orientation, so there really is
a way to do it manually.

The summary is, just about anything is possible, with the display
control panel. I bet you could even turn the panels upside-down,
but there's no incentive to be doing that.

If your Display panel now reverts to the Display page of
the (stinky) Settings wheel, that's not the end of the world. When
the NVidia driver is installed, soon afterwards, the OS
mentions "we will install the NVidia control panel from the
App Store". And I end up with an NVidia icon in the tray
extension area. That panel still has the traditional graphical
representation of multiple LCD panels.

[Picture]

Loading Image...

Paul

knuttle

2024-03-13 12:48:26 UTC

Post by Oliver
I couldn't find an out-of-print book online no matter how hard I tried.
Google Books had it, of course, but it was missing pages.
So I did this procedure, which works, but the results are too blurry.
Is there a better way to snapshot a book in Archive.org displays?
3. Position the book in your web browser as big as you can get it
(Set it up as one page at a time so the right-arrow button works)
4. Start Windows Irfanview & press "c" (capture)
5. Set "Capture method" to "Hot key" to "Ctrl + Left" (or whatever)
6. Set (5) Custom rectangle/region capture
Click the top right corner (e.g., 765:223)
Click the bottom right corner (e.g., 1183:842)

I have done this many times and not just of Archive.
I do a screen capture (I use the standard PrnScr from keyboard)
I paste it in Irfanveiw and crop the book section and save the file.

Depending on the size of the book I may do the same page two or three
times. Each time maintaining the same zoom level. When I crop I use
the same index marks in the book. ie edge of page, and make sure I crop
the section between lines in the book. The idea is to line everything
up for the next step.

I save each section of the page.

When I have the complete page, I use the Image, merge image (old
Panorama) funtion to put the page back together.

Once I have the all of the pages reconstructed and saved. I then use
the Options, Multipage images, Create multipage PDF, to put the pages
back together in one file.

In some Archive.org books you have the o

Peter Johnson

2024-03-13 16:03:54 UTC

In some Archive.org books you have the option of download the book as a
pdf file,

And with some books on Google.

Oliver

2024-03-13 22:12:35 UTC

On Wed, 13 Mar 2024 16:03:54 +0000, Peter Johnson

Post by Peter Johnson

In some Archive.org books you have the option of download the book as a
pdf file,

And with some books on Google.

I found out something interesting about Google Books when I just ran a test
on the book that I picked as a sample for Vanguard since he asked for a
title.

https://books.google.com/books/about/A_Legal_History_of_the_Civil_War_and_Rec.html?id=0vB4BgAAQBAJ
(Is there a way to shorten that link? Let me experiment a bit.)

Oh good.
It's like Amazon where all you need is the B0x...x to construct a link.
https://books.google.com/?id=0vB4BgAAQBAJ
https://books.google.com/books?id=0vB4BgAAQBAJ

I used that link with two different VPNs at the same time, and a
different set of pages showed up, which supports the theory but doesn't
prove that if you return to the Google Books link frequently, you might
just get all the pages.

But it's tedious at best.

Paul

2024-03-14 05:42:51 UTC

Post by Oliver
On Wed, 13 Mar 2024 16:03:54 +0000, Peter Johnson

Post by Peter Johnson

In some Archive.org books you have the option of download the book as a pdf file,

And with some books on Google.

I found out something interesting about Google Books when I just ran a test
on the book that I picked as a sample for Vanguard since he asked for a
title.
https://books.google.com/books/about/A_Legal_History_of_the_Civil_War_and_Rec.html?id=0vB4BgAAQBAJ
(Is there a way to shorten that link? Let me experiment a bit.)
Oh good. It's like Amazon where all you need is the B0x...x to construct a link.
https://books.google.com/?id=0vB4BgAAQBAJ
https://books.google.com/books?id=0vB4BgAAQBAJ
I used that link with two different VPNs at the same time, and a different set of pages showed up, which supports the theory but doesn't
prove that if you return to the Google Books link frequently, you might
just get all the pages.
But it's tedious at best.

It's 212 pages.

The theory:

https://en.wikipedia.org/wiki/Google_Books

The practice:

https://medium.com/@designing/how-i-hacked-google-books-missing-pages-f2d85289ca26

(Well, he didn't really, he used "snippets" to reconstitute a page, which
isn't going to look like a page exactly. And it would take a kazillion
queries to build an entire book, which would not necessarily look
like the original pages, and would have all the OCR errors.)

Paul

Oliver

2024-03-15 01:31:13 UTC

Post by Oliver
But it's tedious at best.

It's 212 pages.

A book could be any number of pages. There's no magic in the number 212.

The script is designed for any book that you need to find, usually to quote
a sentence or two in papers, where fair use is the domain it fits within.

That specific book was simply an example that I already had on my shelf.

It wasn't supposed to be the actual set of books that anyone would want in
the future. It's just one book. It wasn't supposed to anything else.

Just one example of one book that there is no easily available PDF for,
but which is legally available to borrow on archive.org & Google Books.

The script is written for any freely available book on the Internet.
That people ascribe sinister intent is their own minds fabricating it.

The web sites actually came from simple googling for how to legally get
free books PDFs on the web, some of which are these sites listed below.

The only reason I gave that example was to point out to the VanguardLH
poster that there are books out there that have no easily available PDF.

[PCMag][https://www.pcmag.com/how-to/download-free-ebooks-audiobooks-online]
[PDFTech][https://www.pdfreaderpro.com/blog/download-free-pdf-books]
[MakeUseOf][https://www.makeuseof.com/tag/download-books-for-free-from-google-books/]
[PDFGear][https://www.pdfgear.com/pdf-converter/free-ebook-download-sites.htm]

Some of those commonly available sites for ebooks are:
[Authorama][http://www.authorama.com/]
[Ebooks Free][https://www.ebooks.com/en-us/free/]
[Feedbooks][https://www.feedbooks.com/catalog/public_domain]
[Free Ebooks][https://www.free-ebooks.net/]
[Free Engineering Books][http://www.freeengineeringbooks.com/]
[Google Scholar][https://scholar.google.com/]
[Gutenberg Books][https://www.gutenberg.org/]
[Internet Book Archive][https://archive.org/]
[Manybooks Public Domain][https://manybooks.net/]
[PDFGet][https://pdfget.com/]
[Science Government Books][https://www.science.gov/]
[UPenn Online Books Page][https://digital.library.upenn.edu/books/]

Why is everyone claiming those sites are so sinister?
Have they never used any of them?

Carlos E.R.

2024-03-18 14:32:34 UTC

Post by Oliver

Post by Oliver
But it's tedious at best.

It's 212 pages.

A book could be any number of pages. There's no magic in the number 212.
The script is designed for any book that you need to find, usually to quote
a sentence or two in papers, where fair use is the domain it fits within.

Yes, but that doesn't entitle you to download for free the entire book.

You are entitled to read it online at a library, in a way similar to a
city or university library. You borrow the book, you can not copy it
legally.

Some can excuse the copy if the book is out of print and no one sells an
electronic copy.

--
Cheers, Carlos.

Oliver

2024-03-13 21:21:06 UTC

Post by knuttle

Post by Oliver
4. Start Windows Irfanview & press "c" (capture)
5. Set "Capture method" to "Hot key" to "Ctrl + Left" (or whatever)
6. Set (5) Custom rectangle/region capture
�� Click the top right corner (e.g., 765:223)
�� Click the bottom right corner (e.g., 1183:842)

I have done this many times and not just of Archive.
I do a screen capture (I use the standard PrnScr from keyboard)
I paste it in Irfanveiw and crop the book section and save the file.

Thanks Keith, as I think I learned this Irfanview screen-capture method
from you (or maybe it was Paul, I don't remember as it was long ago).

I also learned from this newsgroup that the standard "control+y) will crop
inside of Irfanview, but MUCH BETTER is the non-standard "control+shift+y".

That will crop out all white space, or in reality, all monotone colors on
the outside of the image, which works especially well for displays such as
that of books where there is usually a clear delineation between the book
pages and the web page background as part of the Google Books or
Archive.org (or whatever web site) GUI that is displaying the book pages.

But in this case, since archive.org (and Google Books) displayed the pages
in EXACTLY the same place, using the Irfanview fixed rectangle works well.

If we take an arbitrary example using the script I posted elsewhere...
<https://www.novabbs.com/computers/article-flat.php?id=77704&group=alt.comp.os.windows-10#77704>
Looking for this item which I have in my hands as an example:
"A legal history of the civil war and reconstruction: A nation of rights,
by Laura F. Edwards, 2015, 1107008794, 978-1107008793

Here are some links that are found just now using the script listed above.
https://archive.org/details/legalhistoryofci0000edwa
https://openlibrary.org/search (finds item 0vB4BgAAQBAJ)
https://books.google.com/books/about/A_Legal_History_of_the_Civil_War_and_Rec.html?id=0vB4BgAAQBAJ
https://books.google.com/books?id=9asPBgAAQBAJ&source=gbs_book_other_versions
And so on.

Each of those presents the pages of that book in the same spot.
Which makes it easy to use the static rectangle Irfanview method.

The problem is that the text is blurry from Google Books
(which is also missing pages so you have to re-visit using VPNs).

But worse is the archive.org that has the whole book, but every page
is blurry.

I tried to snap JPEGs and then sharpen them and then save to PDF,
but I didn't try TIFF yet. Do you think TIFF can be sharpened better?

Is there a sharpen technique for PDF pages?

Post by knuttle
Depending on the size of the book I may do the same page two or three
times. Each time maintaining the same zoom level. When I crop I use
the same index marks in the book. ie edge of page, and make sure I crop
the section between lines in the book. The idea is to line everything
up for the next step.
I save each section of the page.

That's a good idea if we can find a way to automate that for potentially
three hundred pages times three or four zoom levels per page.

Assuming we zoom so that only 1/3rd the page is showing at a time,
we could snap, oh, say, 100 pages at that zoom level.

Then we could re-do those 100 pages lowering the zoom to the middle third.
And then re-doing it the third time on the bottom 1/3rd of those 100 pages.

That would work if Irfanview has a batch "Merge" feature (vertical merge).
But I can't find it in the advanced GUI options when I press "b" in IV.

Post by knuttle
When I have the complete page, I use the Image, merge image (old
Panorama) funtion to put the page back together.
Once I have the all of the pages reconstructed and saved. I then use
the Options, Multipage images, Create multipage PDF, to put the pages
back together in one file.

I've been using the abc functions of Irfanview for years (a=about, b=batch,
and c=capture) where the IV merge is powerful - but can it be batched?

Is there a batch merge feature where we can take 300 shots of the same 100
pages, the first 100 are the top 1/3rd of the page, the second 100 are the
middle 1/3rd of the page, and the third 100 is the bottom 1/3 of the page?

Can Irfanview be scripted to merge files that way?
That might work. Good idea. If it's possible to batch the process.

It would be a general purpose solution that triples the resolution, so to
speak, right?

Post by knuttle
In some Archive.org books you have the option of download the book as a
pdf file,

I understand. I agree. But often they are not, as in the sample I provided
above for ISBN 978-1107008793 but even so the question is seeking a general
purpose clever solution to the basic problem of screenshot resolution.

knuttle

2024-03-13 23:51:31 UTC

Post by Oliver

Post by knuttle

Post by Oliver
4. Start Windows Irfanview & press "c" (capture)
5. Set "Capture method" to "Hot key" to "Ctrl + Left" (or whatever)
6. Set (5) Custom rectangle/region capture
�� Click the top right corner (e.g., 765:223)
�� Click the bottom right corner (e.g., 1183:842)

I have done this many times and not just of Archive.
I do a screen capture (I use the standard PrnScr from keyboard)
I paste it in Irfanveiw and crop the book section and save the file.

Thanks Keith, as I think I learned this Irfanview screen-capture method
from you (or maybe it was Paul, I don't remember as it was long ago).
I also learned from this newsgroup that the standard "control+y) will crop
inside of Irfanview, but MUCH BETTER is the non-standard "control+shift+y".
That will crop out all white space, or in reality, all monotone colors on
the outside of the image, which works especially well for displays such as
that of books where there is usually a clear delineation between the book
pages and the web page background as part of the Google Books or
Archive.org (or whatever web site) GUI that is displaying the book pages.
But in this case, since archive.org (and Google Books) displayed the pages
in EXACTLY the same place, using the Irfanview fixed rectangle works well.
If we take an arbitrary example using the script I posted elsewhere...
<https://www.novabbs.com/computers/article-flat.php?id=77704&group=alt.comp.os.windows-10#77704>
"A legal history of the civil war and reconstruction: A nation of
rights, by Laura F. Edwards, 2015, 1107008794, 978-1107008793
Here are some links that are found just now using the script listed above.
https://archive.org/details/legalhistoryofci0000edwa
https://openlibrary.org/search (finds item 0vB4BgAAQBAJ)
https://books.google.com/books/about/A_Legal_History_of_the_Civil_War_and_Rec.html?id=0vB4BgAAQBAJ
https://books.google.com/books?id=9asPBgAAQBAJ&source=gbs_book_other_versions
And so on.
Each of those presents the pages of that book in the same spot.
Which makes it easy to use the static rectangle Irfanview method.
The problem is that the text is blurry from Google Books (which is also
missing pages so you have to re-visit using VPNs).
But worse is the archive.org that has the whole book, but every page
is blurry.
I tried to snap JPEGs and then sharpen them and then save to PDF, but I
didn't try TIFF yet. Do you think TIFF can be sharpened better?
Is there a sharpen technique for PDF pages?

Post by knuttle
Depending on the size of the book I may do the same page two or three
times. Each time maintaining the same zoom level. When I crop I use
the same index marks in the book. ie edge of page, and make sure I
crop the section between lines in the book. The idea is to line
everything up for the next step.
I save each section of the page.

That's a good idea if we can find a way to automate that for potentially
three hundred pages times three or four zoom levels per page.
Assuming we zoom so that only 1/3rd the page is showing at a time, we
could snap, oh, say, 100 pages at that zoom level.
Then we could re-do those 100 pages lowering the zoom to the middle
third. And then re-doing it the third time on the bottom 1/3rd of those
100 pages.
That would work if Irfanview has a batch "Merge" feature (vertical merge).
But I can't find it in the advanced GUI options when I press "b" in IV.

Post by knuttle
When I have the complete page, I use the Image, merge image (old
Panorama) funtion to put the page back together.
Once I have the all of the pages reconstructed and saved. I then use
the Options, Multipage images, Create multipage PDF, to put the pages
back together in one file.

I've been using the abc functions of Irfanview for years (a=about, b=batch,
and c=capture) where the IV merge is powerful - but can it be batched?
Is there a batch merge feature where we can take 300 shots of the same 100
pages, the first 100 are the top 1/3rd of the page, the second 100 are the
middle 1/3rd of the page, and the third 100 is the bottom 1/3 of the page?
Can Irfanview be scripted to merge files that way? That might work. Good
idea. If it's possible to batch the process.
It would be a general purpose solution that triples the resolution, so to
speak, right?

Post by knuttle
In some Archive.org books you have the option of download the book as
a pdf file,

I understand. I agree. But often they are not, as in the sample I provided
above for ISBN 978-1107008793 but even so the question is seeking a general
purpose clever solution to the basic problem of screenshot resolution.

I don't know how heavy an Irfanview user you are but there is a function
to automate some processes on several files.

This is the deceptively named Irfanview File, Batch conversion and
rename. While this function does what it says, By checking the advance
Option, and then Clicking Advance, does many things. (I have not
explored all) I have only used the Resize option, but there is a Crop
option, and many other things you can do in on the files you put in the
windows.

I did not see any way to move to another section of a page or to advance
to another URL.

One other thing, it is possible that mouse dribble has changed some
setting that cause the blurred letters. Have you changed the font
settings in the browser, or in the Windows OS? The refresh rate can
sometimes do amazing things.

Try setting the monitor and

Peter Jason

2024-03-15 21:57:20 UTC

Have you tried Internet Archive?

https://archive.org/

Oliver

2024-03-15 23:34:34 UTC

Post by Peter Jason
Have you tried Internet Archive?
https://archive.org/

Yes. Every known publicly freely available Internet book archive
is expected to be included into the links.txt file (which will later be
improved to ask which browser and to run the search on each tab).

@echo off
set LINKS=links.txt
set BROWSER=firefox

FOR /F %%i in (%LINKS%) do start %BROWSER% -new-tab %%i

set LINKS=
set BROWSER=

I've started testing the ability to switch to any desired browser.
@echo off
echo "Open web browser to a variety of book sites"
echo USAGE:
echo -Press "1" to open book sites in the Firefox web browser
echo -Press "2" to open book sites in the Chrome web browser
echo -Press "3" to open book sites in the Internet Explorer web browser
echo -Press "x" to exit.
echo.
set /p option=Your option:
if '%option%'=='1' goto :option1
if '%option%'=='2' goto :option2
if '%option%'=='3' goto :option3
if '%option%'=='x' goto :exit
echo Enter browser 1, 2, 3 or x

The current links.txt file contains the following:
"http://www.authorama.com/"
"https://annas-archive.org/"
"https://archive.org/"
"https://bitsearch.to/"
"https://digital.library.upenn.edu/books/"
"https://libgen.is/"
"https://libgen.li/"
"https://libgen.rs/"
"https://libgen.st/"
"https://librarygenesis.net/"
"https://librivox.org/"
"https://manybooks.net/"
"https://openaccessbutton.org/"
"https://openlibrary.org/"
"https://pdfget.com/"
"https://pdfgrab.com/"
"https://scholar.google.com/"
"https://sci-hub.ru/"
"https://unpaywall.org"
"https://www.academia.edu"
"https://www.base-search.net/"
"https://www.ebooks.com/en-us/free/"
"https://www.feedbooks.com/catalog/public_domain"
"https://www.free-ebooks.net/"
"https://www.freeengineeringbooks.com/"
"https://www.gutenberg.org/"
"https://www.readanybook.com/"
"https://www.researchgate.net/"
"https://www.science.gov/"
"https://www.tandfonline.com/"
"https://zlibrary.to/"

Do you know of other common ebook archives which contain
legally available free public PDFs that have been missed?

Peter Jason

2024-03-16 05:58:28 UTC

Post by Oliver

Post by Peter Jason
Have you tried Internet Archive?
https://archive.org/

Yes. Every known publicly freely available Internet book archive
is expected to be included into the links.txt file (which will later be
improved to ask which browser and to run the search on each tab).
@echo off
set LINKS=links.txt
set BROWSER=firefox
FOR /F %%i in (%LINKS%) do start %BROWSER% -new-tab %%i
set LINKS=
set BROWSER=
I've started testing the ability to switch to any desired browser.
@echo off
echo "Open web browser to a variety of book sites"
echo -Press "1" to open book sites in the Firefox web browser
echo -Press "2" to open book sites in the Chrome web browser
echo -Press "3" to open book sites in the Internet Explorer web browser
echo -Press "x" to exit.
echo.
if '%option%'=='1' goto :option1
if '%option%'=='2' goto :option2
if '%option%'=='3' goto :option3
if '%option%'=='x' goto :exit
echo Enter browser 1, 2, 3 or x
"http://www.authorama.com/"
"https://annas-archive.org/"
"https://archive.org/"
"https://bitsearch.to/"
"https://digital.library.upenn.edu/books/"
"https://libgen.is/"
"https://libgen.li/"
"https://libgen.rs/"
"https://libgen.st/"
"https://librarygenesis.net/"
"https://librivox.org/"
"https://manybooks.net/"
"https://openaccessbutton.org/"
"https://openlibrary.org/"
"https://pdfget.com/"
"https://pdfgrab.com/"
"https://scholar.google.com/"
"https://sci-hub.ru/"
"https://unpaywall.org"
"https://www.academia.edu"
"https://www.base-search.net/"
"https://www.ebooks.com/en-us/free/"
"https://www.feedbooks.com/catalog/public_domain"
"https://www.free-ebooks.net/"
"https://www.freeengineeringbooks.com/"
"https://www.gutenberg.org/"
"https://www.readanybook.com/"
"https://www.researchgate.net/"
"https://www.science.gov/"
"https://www.tandfonline.com/"
"https://zlibrary.to/"
Do you know of other common ebook archives which contain
legally available free public PDFs that have been missed?

Thanks for the info.

I can think of one other site with hi-quality pdfs.....
https://www.globalgreyebooks.com/index.html

45 Replies
18 Views
Permalink to this page
Disable enhanced parsing

Thread Navigation

Oliver 2024-03-13 05:37:21 UTC

VanguardLH 2024-03-13 07:13:22 UTC

Oliver 2024-03-13 09:00:26 UTC

Oliver 2024-03-13 19:14:13 UTC

Frank Slootweg 2024-03-13 19:52:58 UTC

Oliver 2024-03-13 21:48:41 UTC

Carlos E.R. 2024-03-18 14:23:38 UTC

Paul 2024-03-14 05:14:08 UTC

VanguardLH 2024-03-13 20:59:18 UTC

Oliver 2024-03-13 22:02:18 UTC

VanguardLH 2024-03-14 00:00:02 UTC

knuttle 2024-03-14 00:05:55 UTC

VanguardLH 2024-03-14 00:59:44 UTC

Oliver 2024-03-14 01:32:53 UTC

Carlos E.R. 2024-03-18 14:15:29 UTC

Julian Bradfield 2024-03-14 10:41:37 UTC

Newyana2 2024-03-14 11:39:15 UTC

Paul 2024-03-14 15:39:35 UTC

Frank Slootweg 2024-03-14 16:29:44 UTC

Oliver 2024-03-15 01:34:13 UTC

Newyana2 2024-03-15 12:36:31 UTC

knuttle 2024-03-15 17:58:28 UTC

Oliver 2024-03-15 20:40:03 UTC

Oliver 2024-03-15 20:40:13 UTC

Oliver 2024-03-15 23:57:23 UTC

Julian Bradfield 2024-03-16 09:45:33 UTC

Oliver 2024-03-16 12:35:52 UTC

Julian Bradfield 2024-03-16 19:58:35 UTC

Oliver 2024-03-18 06:30:42 UTC

Julian Bradfield 2024-03-18 08:14:53 UTC

Carlos E.R. 2024-03-18 14:21:16 UTC

Oliver 2024-03-15 20:40:19 UTC

Herbert Kleebauer 2024-03-14 11:15:35 UTC

Oliver 2024-03-13 20:59:30 UTC

Paul 2024-03-14 05:06:43 UTC

knuttle 2024-03-13 12:48:26 UTC

Peter Johnson 2024-03-13 16:03:54 UTC

Oliver 2024-03-13 22:12:35 UTC

Paul 2024-03-14 05:42:51 UTC

Oliver 2024-03-15 01:31:13 UTC

Carlos E.R. 2024-03-18 14:32:34 UTC

Oliver 2024-03-13 21:21:06 UTC

knuttle 2024-03-13 23:51:31 UTC

Peter Jason 2024-03-15 21:57:20 UTC

Oliver 2024-03-15 23:34:34 UTC

Peter Jason 2024-03-16 05:58:28 UTC

about - legalese

Loading...