Please enable JavaScript.
Coggle requires JavaScript to display documents.
Data mining methods (Web Scrapping (Extract large amounts of data from…
Data mining methods
Web Scrapping
Extract large amounts of data from websites
Automating copying data process from website
Usally done by a bot/webcrawler
Gathered data can be used for further analysis
Involves fetching it and extracting from website
XPath
A way to retrieve specified nodes within XML or HTML pages.
XML Path
One of the powerful tool for webscraping
Image Scrapping
Scrap image data from a website
Can use phyton or image scrapping tools (web plugin)
Download the image itself
HTML
Most used data mining sources
Can be used by web/text scrapping, Xpath
Website backbone
OCR
Convert text data from document into editable and searchable data
Not data mining oriented since it doesnt recognize new pattern