AWS News Blog
The AWS Report – Lisa Green of Common Crawl
In the latest episode of The AWS Report, I spoke with Lisa Green of Common Crawl to learn more about what they do and how they use AWS:
The Common Crawl data is available in the form of an AWS Public Data Set. If you are planning to process this large (81 TB) data set, you may also want to take a look at the Common Crawl Index and the Common Crawl Tutorial.
— Jeff;