gocolly / colly
- вторник, 19 декабря 2017 г. в 10:12:31
Fast and Elegant Scraping Framework for Golang
Lightning Fast and Elegant Scraping Framework for Gophers
Colly provides a clean interface to write any kind of crawler/scraper/spider.
With Colly you can easily extract structured data from websites, which can be used for a wide range of applications, like data mining, data processing or archiving.
func main() {
c := colly.NewCollector()
// Find and visit all links
c.OnHTML("a[href]", func(e *colly.HTMLElement) {
e.Request.Visit(e.Attr("href"))
})
c.OnRequest(func(r *colly.Request) {
fmt.Println("Visiting", r.URL)
})
c.Visit("https://en.wikipedia.org/")
}
See examples folder for more detailed examples.
go get -u github.com/gocolly/colly/...
Bugs or suggestions? Visit the issue tracker or join #colly
on freenode
Below is a list of public, open source projects that use Colly:
If you are using Colly in a project please send a pull request to add it to the list.
This project exists thanks to all the people who contribute. [Contribute].
Thank you to all our backers!
Support this project by becoming a sponsor. Your logo will show up here with a link to your website. [Become a sponsor]