Skip to content

[Bug] Parsing lists to HTML has double bullet points #64

@piu130

Description

@piu130

When parsing lists in msgs, the resulting html (.getHTMLText()) contains the list tag as well as the bullet point in text. The browser renders a bullet point, because of the list tag and a second bullet point from the text.

Outlook:
image
Browser:
image
HTML:

<p class=MsoNormal>Stroke:<o:p></o:p></p>
<ul style='margin-top:0cm' type=disc>
  <li class=MsoListParagraph style='margin-left:0cm;mso-list:l1 level1 lfo1'>*	Stroke entry 1<o:p></o:p></li>
  <li class=MsoListParagraph style='margin-left:0cm;mso-list:l1 level1 lfo1'>*	Stroke entry 2<o:p></o:p></li>
</ul>
<p class=MsoNormal>Bullet:<o:p></o:p></p>
<ul style='margin-top:0cm' type=disc>
  <li class=MsoListParagraph style='margin-left:0cm;mso-list:l0 level1 lfo2'>*	Bullet entry 1<o:p></o:p></li>
  <li class=MsoListParagraph style='margin-left:0cm;mso-list:l0 level1 lfo2'>*	Bullet entry 2<o:p></o:p></li>
</ul>
<p class=MsoNormal>Number:<o:p></o:p></p>
<ol style='margin-top:0cm' start=1 type=1>
  <li class=MsoListParagraph style='margin-left:0cm;mso-list:l2 level1 lfo3'>1.	Number entry 1<o:p></o:p></li>
  <li class=MsoListParagraph style='margin-left:0cm;mso-list:l2 level1 lfo3'>2.	Number entry 2<o:p></o:p></li>
</ol>

We should remove the * and the 1. (and tab?) from the html text.
Otherwise we can also fix this on our side by replacing type=disc with type=none or removing the first (two) char(s) in the list.

What do you think?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions