Add support for parsing Markdown inside HTML #38

adam-fowler · 2020-01-10T22:20:16Z

See CommonMark example 108 https://spec.commonmark.org/0.18/#example-108.

Added GroupFragment protocol that can hold multiple fragments. HTML fragment comforms to this as it needs to hold its HTML plus any Markdown fragments included inside the HTML block.

HTML parsing will set state possibleMarkdown whenever it comes across two newlines in a row. At this point it checks for markdown characters and tries to parse them.

Checkout the tests I added for examples of markdown inside HTML.

I specifically added this so I could create a div with a class attribute containing images as follows. Thus allowing me to use css to style the images.

<div class="myimageclass">

![My image](image.jpg)

</div>

john-mueller

Hi @adam-fowler, thanks for putting this together! I'm hoping @JohnSundell has time to look at this soon, but I found a couple of things that need to be fixed for this to be merged.

First, could you rebase this on master, so that it will merge nicely with other pending PRs?

Second, there is a string index out-of-bounds error I found when running this against the CommonMark test suite. I added a guard statement in the review comments. I also added a couple nitpicky formatting details based on previous comments by John.

With those two changes, this PR causes CommonMark tests 122, 125, and 157 to pass, with no newly failing tests. Being able to wrap elements in styling <div> tags will be great!

john-mueller · 2020-03-26T10:56:41Z

Sources/Ink/Internal/HTML.swift

+                    possibleMarkdown = true
+                }
+            }
+


Suggested change

guard !reader.didReachEnd else { break }

The following input (example 125 in the current CommonMark spec) causes an out-of-bounds error:

<div> *foo* *bar*

This guard statement fixes the issue, and all other current spec tests run without crashing.

I had to put the guard statement slightly further up to get it to work. ie above the double newline check

Oops, that's where it was supposed to be.

john-mueller · 2020-03-26T11:16:18Z

Tests/InkTests/HTMLTests.swift

        XCTAssertEqual(html, "<p>Hello</p><br/><p>World</p>")
    }
-
+    


Suggested change

Picky, but I think John prefers not to have this indentation.

Tests/InkTests/HTMLTests.swift

john-mueller · 2020-03-26T11:21:39Z

Tests/InkTests/HTMLTests.swift

+
+        XCTAssertEqual(html, src)
+    }
+


Suggested change

func testUnclosedHTMLWithDoubleNewline() {

let html = MarkdownParser().html(from: """

<div>

*foo*

*bar*

""")

XCTAssertEqual(html, "<div>\n*foo*<p><em>bar</em></p>")

}

Here's an extra test that catches the out-of-bounds error.

john-mueller · 2020-03-26T11:22:25Z

Tests/InkTests/HTMLTests.swift

+            ("testMarkdownBeforeHTML", testMarkdownBeforeHTML),
+            ("testMarkdownAfterHTML", testMarkdownAfterHTML),
+            ("testMultipleMarkdownInsideHTML", testMultipleMarkdownInsideHTML),
+            ("testHTMLWithDoubleNewline", testHTMLWithDoubleNewline),


Suggested change

("testHTMLWithDoubleNewline", testHTMLWithDoubleNewline),

("testHTMLWithDoubleNewline", testHTMLWithDoubleNewline),

("testUnclosedHTMLWithDoubleNewline", testUnclosedHTMLWithDoubleNewline),

Add extra text case.

You don't seem to have included the test. I'll add it

john-mueller

Looks good to me!

alexito4 · 2020-03-31T19:46:27Z

I think this would help too for cases where you want to have some code hidden:

<details><summary>See the entire config</summary>
-> start code inside details ```
code
-> end code inside details ```
</details>

See CommonMark example 108 https://spec.commonmark.org/0.18/#example-108. Added GroupFragment protocol that can hold multiple fragments as HTML fragment will now need to hold its HTML plus any Markdown fragments inside the HTML block.

This fixes https://spec.commonmark.org/0.29/#example-125. Previously it was crashing

Co-Authored-By: John Mueller <jmuellerokc@gmail.com>

https://spec.commonmark.org/0.29/#example-125

john-mueller suggested changes Mar 26, 2020

View reviewed changes

adam-fowler force-pushed the markdown-in-html branch from 07e6a7e to d699da9 Compare March 26, 2020 11:52

adam-fowler changed the title ~~Add support for parsing Markdown inside HTML~~ Add support for parsing HTML inside Markdown Mar 26, 2020

adam-fowler changed the title ~~Add support for parsing HTML inside Markdown~~ Add support for parsing Markdown inside HTML Mar 26, 2020

john-mueller approved these changes Mar 26, 2020

View reviewed changes

adam-fowler and others added 5 commits November 27, 2020 07:32

Add support for parsing Markdown inside HTML

de2d53a

See CommonMark example 108 https://spec.commonmark.org/0.18/#example-108. Added GroupFragment protocol that can hold multiple fragments as HTML fragment will now need to hold its HTML plus any Markdown fragments inside the HTML block.

Check for open ended HTML tags

c8ffe10

This fixes https://spec.commonmark.org/0.29/#example-125. Previously it was crashing

Code review: Formatting suggestions

abd8429

Co-Authored-By: John Mueller <jmuellerokc@gmail.com>

Added testUnclosedHTMLWithDoubleNewline()

e8bccb6

https://spec.commonmark.org/0.29/#example-125

Allow links in markdown inside HTML

518d58a

adam-fowler force-pushed the markdown-in-html branch from 0c1b509 to 518d58a Compare November 27, 2020 07:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for parsing Markdown inside HTML #38

Add support for parsing Markdown inside HTML #38

Uh oh!

adam-fowler commented Jan 10, 2020

Uh oh!

john-mueller left a comment

Uh oh!

john-mueller Mar 26, 2020

Uh oh!

adam-fowler Mar 26, 2020

Uh oh!

john-mueller Mar 26, 2020

Uh oh!

john-mueller Mar 26, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

john-mueller Mar 26, 2020

Uh oh!

john-mueller Mar 26, 2020

Uh oh!

adam-fowler Mar 26, 2020

Uh oh!

john-mueller left a comment

Uh oh!

alexito4 commented Mar 31, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

+    func testUnclosedHTMLWithDoubleNewline() {
+        let html = MarkdownParser().html(from: """
+        <div>
+        *foo*
+        *bar*
+        """)
+        XCTAssertEqual(html, "<div>\n*foo*<p><em>bar</em></p>")
+    }

	("testHTMLWithDoubleNewline", testHTMLWithDoubleNewline),
	("testHTMLWithDoubleNewline", testHTMLWithDoubleNewline),
	("testUnclosedHTMLWithDoubleNewline", testUnclosedHTMLWithDoubleNewline),

Add support for parsing Markdown inside HTML #38

Are you sure you want to change the base?

Add support for parsing Markdown inside HTML #38

Uh oh!

Conversation

adam-fowler commented Jan 10, 2020

Uh oh!

john-mueller left a comment

Choose a reason for hiding this comment

Uh oh!

john-mueller Mar 26, 2020

Choose a reason for hiding this comment

Uh oh!

adam-fowler Mar 26, 2020

Choose a reason for hiding this comment

Uh oh!

john-mueller Mar 26, 2020

Choose a reason for hiding this comment

Uh oh!

john-mueller Mar 26, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

john-mueller Mar 26, 2020

Choose a reason for hiding this comment

Uh oh!

john-mueller Mar 26, 2020

Choose a reason for hiding this comment

Uh oh!

adam-fowler Mar 26, 2020

Choose a reason for hiding this comment

Uh oh!

john-mueller left a comment

Choose a reason for hiding this comment

Uh oh!

alexito4 commented Mar 31, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants