-
Notifications
You must be signed in to change notification settings - Fork 5
Domains
The datawake is part of a larger project to improve domain-specific search. In those terms 'domain' simply refers to a body of knowledge that a user may be interested. Some example domains could be "Computer Science Research", or "Famous Witches and Wizards".
When we refer to a domain within the datawake what we really mean is a list of Type:Value pairs that make up our knowledge about a subject. In the default datawake environment we look at 3 types; phone,email, and website. A silly example domain might be
email,myemail@gmail.com
phone,5555555555
website,http://www.mywebsite.com
"Extractors" can easily be added to the datawake system to produce more types. Extractors run over the content of a web page and return a list of values.
When values extracted from a page are found in your pre-defined domain they are pulled out and highlighted as they may be of significant interest. Additionally the datawake provides hooks to domain-specific tools that can be used to view more information about an extracted type:value pair.