“Digital typography” or my experience in mobile book digitization
3r3-31. Do you like books the way I love them
Childhood and youth spent in a small town, where in the district library from encyclopedias there was only the “Big Encyclopedic Dictionary” taught to be careful, almost reverent attitude to any technical book. I understand why people who survived the blockade kept a supply of food at home. The first time, getting access to a more or less high-speed Internet all the time wanted to download new books and save them to your hard drive, save, save :). Then came twirpx and I realized that books, like knowledge, must participate in a constant circulation, otherwise they are dead. It was worth once to scan the monograph of his supervisor and hear dozens of reviews downloaded as an avalanche could not be stopped. I noticed that today, after sharing a rare book, tomorrow I will see two, if not three rare ones that others have shared.
Part 1 , part 2
Digital “typography” A step-by-step guide to digitizing books. Part 1 , 3r333. part 2
, part 3
Digital "typography". Camera instead scanner 3r3186. Article 3r3192.
The fascination for scanning came at the time when it was just starting to fill up with 3r-351. twirpx and worked fine avaxhome . Having scanned about fifty books, algorithms gradually began to crystallize out, which would make it possible to get material convenient for reading on a 10 "tablet (not to mention a computer monitor) of sufficiently high quality and at the same time save time spent on processing one book. 3r3197.
Honestly, several times I really wanted to make a real book scanner, like the one described on Habré (3r-357. Book scanner with your own hands 3r3192.), Or even better, such as made cool German Daddy (video 3r3-359. Part 1
, .2 ,
? part 3
). But thoughts about homemade products are visited when there is a lot of free time for reflection (and material, and tools, etc., etc.). More often, all this is not at hand, but a book is needed. And we need it urgently, and even in acceptable quality.
Therefore, for quite some time now I have been using an uncomplicated software and hardware complex, which allows me to create fairly high-quality copies of books in a short time. For example, it takes about an hour to process one 300 page book (starting from photographing and finishing with coding in djvu) using a PC based on AMD Athlon II X???/16 Gb RAM /4 Tb SATA 3.0 HDD.
3r3173. The same, but shot from a different angle :)
The following items are included in the gentleman's set of mobile digital book printer:
1) Nokia PureView 808 3r3197 smartphone.
2) Movable stand-clamp
3) Mount for smartphone
4) Bluetooth remote control Coco CC-PC101
Nokia's smartphone is chosen for its reliability and maximum matrix size. Well, I love him very much :) (and on Habré he Sang the praises ). Among the shortcomings, it can be noted that, unlike Android smartphones, I had to search for a suitable remote for a long time, which would work with my phone. In the end, I settled on r3r3185. Coco CC-PC101 [/b] . Moreover, this remote control works only with the CameraPro program (the standard application does not pick it up). When using Android, any cheap remote control from Aliexpress will do.
3r3173. The principle of 'smaller book-tripod lower' principle works [/b]
A movable bar with which you can adjust the height of the smartphone above the book - the usual
selfie stick self-stick, but with the presence in the lower part of the standard 1/4 "thread for fastening to a clamp /any other stand. On aliexpress there are a lot of options for the price /parameters I liked" 3r3122. Monopod for GoPro Hero ???
The mount for the smartphone is also first available 3r3192. with 1/4 "threads, not the cheapest (unlike wire options), but I liked its shape. And so far there are no problems with it.
Stand-clamp - Soviet-made UTM LSNH. Pure duralumin, real joy for the engineer, well, just a very reliable tool with many adjustments.
My smartphone is quite heavy, + the weight of the telescopic bar, so I do not trust plastic Chinese clamps. But they have a place to be.
The process of photographing itself is not particularly complex. The book is positioned so as to get into the focus of the camera and with the help of the remote control focusing /shooting. Turned the pages - "focus /shooting." In this case, I try to position the book so that all edges are visible (this is necessary to align the curvature of the pages in the ScanTailor program). A few words of praise about her. Previously, I had to use either a rather capricious (often crashed with an error) and a paid program BookRestorer, or the “tongue-tied” ScanKromsator (although I’m more than sure that it will have its own fans :)). But thank God, ScanTailor has appeared, and the life of the booksellers like me has been greatly simplified. Here is what r3r3140 says. Wikipedia 3r3192. about this:
3r3195. Scan Tailor (English scan - scan, tailor - tailor) - a computer program for processing images obtained using a scanner. It is a cross-platform program and runs under Microsoft Windows, Linux and Mac OS X operating systems. The high level of the program was awarded following the results of the first contest “The Best Free Project of Russia” in 200? held by the Linux Format magazine 3r3196.
The main advantage of the program is automatic trimming, cleaning and straightening of lines. Moreover, straightening works on the same principle as the Japanese “robot for scanning books” about which they wrote on Habré (3r3148. Japanese scanner digitizes a book of 250 pages per minute 3r3192.). Let me extract from this article:
3r3195. An open book is photographed using lasers (they form a grid on the surface). In this case, photographing is done from several angles at once, after which all three frames are automatically combined. The developers claim that their method allows to avoid the distortions that usually occur with standard scanning.
The same principle is used in ScanTailor, only the layout of the marking grid on the page is regulated by the user. I align the grid along the edges of the pages (for this, they must be visible when shooting).
3r3173. Sample page without straightening lines 3r3r6186.
3r3173. Sample page using straightening lines [/b]
After the program finishes, there will be ready pages in the out folder. We load them into any DJVU converter (you can choose on
Site ). I am using 3r3185. DEE 3r3186. - Document Express Editor v??? Build 1320 LE (for NT) (Light Edition for NT) for its small size and fast work. In principle, after DEE, the book can be thrown onto your favorite reader /smartphone and used for its intended purpose. If time and effort allow, you can add an OCR layer and a table of contents. These procedures are described in detail in my article, to which I referred at the beginning of the article.
I hope my experience will be useful to all those who photograph books on the phone and then read them from pictures in the gallery :)
p.s. On Habré there was an article (3r3191. Digitizing the world book heritage with the help of smartphones 3r3192.). Where: 3r3197.
3r3195. Literu conducted several tests and found out that in this way one user, after adapting himself, would be able to digitize a 600-page book in five to ten minutes. He himself in 200? for his thesis, manually digitized thirty thousand pages of materials from more than seven hundred documents, using an ordinary digital camera and a cheap desk lamp. Most of this work was performed by Litaru within fifteen hours on one of the days off.
So this, dear Kalev to Litar, if you read Habr - write to me, maybe I can advise :))
It may be interesting
I am overwhelmed by your post with such a nice topic. Usually I visit your blogs and get updated through the information you include but today’s blog would be the most appreciable. Well done!
Took me time to understand all of the comments, but I seriously enjoyed the write-up. It proved being really helpful to me and Im positive to all of the commenters right here! Its constantly nice when you can not only be informed, but also entertained! I am certain you had enjoyable writing this write-up.