UfXtract is an API that extracts microformats from web pages, HTML fragments or HTML files. It can output the results in JSON, XML or text. There is JSON-P support for use with JavaScript. You can also download the .Net code from GitHub.
https://ufxtract.com/api/?url=http://microformats.org/&format=hcard&output=json
A single hCard on a page would return output as in the example below. The format is based on ufJson documented on the microformats wiki. The API compresses the JSON output removing all spaces and returns. If you would like a more readable layout try using the Javascript beautifier.
{ "microformats": { "vcard": [{ "fn": "Tantek", "nickname": ["Tantek"], "photo": ["http:\/\/www.gravatar.com\/avatar\/02cd45622e90350cc061aaaa02229195?s=16&d=http:\/\/www.gravatar.com\/avatar\/ad516503a11cd5ca435acc9bb6523536?s=16&r=PG"], "url": ["http:\/\/tantek.com\/"] }], } }
UfXtract has very simple in built error reporting. Below is an example of calling the API with an empty URL.
{ "microformats": { "errors": [{ "msg": "Invalid URI: The hostname could not be parsed.", "url": "http:\/\/" }] } }
© 2007-2010 Glenn Jones. All Rights Reserved.