DiscoverSoftware Engineering DailyInternet Archive Book Scanning with Davide Semenzin
Internet Archive Book Scanning with Davide Semenzin

Internet Archive Book Scanning with Davide Semenzin

Update: 2020-09-15
Share

Description


The Internet Archive collects historical records of the Internet. The Wayback Machine is one tool from the Internet Archive which you may be familiar with. One project you may be unfamiliar with is book scanning. Internet Archive scans high volumes of books in order to digitize them.


In today’s episode, Davide Semenzin joins the show to talk through the history of the Internet Archive and the engineering behind book digitization. We talk through OCR, storage, architecture, and scalability.


Sponsorship inquiries: sponsor@softwareengineeringdaily.com


The post Internet Archive Book Scanning with Davide Semenzin appeared first on Software Engineering Daily.

Comments 
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Internet Archive Book Scanning with Davide Semenzin

Internet Archive Book Scanning with Davide Semenzin

Software Engineering Daily