Get HtmlReader to work again

wrote unit tests and documentation, improved regular expression.
The HtmlReader is enabled by default now and parses metadata in html
files of the form:
<!-- key:value -->
This commit is contained in:
Florian Jacob 2012-09-02 10:09:08 +02:00
commit 39db9ddcfd
7 changed files with 72 additions and 15 deletions

View file

@ -0,0 +1,13 @@
<!-- title: A great html article with metadata -->
<!-- tags: foo, bar, foobar -->
<!-- date: 2010-12-02 10:14 -->
<!-- category: yeah -->
<!-- author: Alexis Métaireau -->
<!-- summary:
Multi-line metadata should be supported
as well as <strong>inline markup</strong>.
-->
<!-- custom_field: http://notmyidea.org -->
<h1>This is an article in html with metadata</h1>
<p>It features very interesting insights.</p>