I have below content in Java where I want to strip only html tags but not new line characters
test1 test2 test 3
//lin
You get a single line because text()
remove all whitepace characters.
But you can use a StringBuilder
and insert each line there:
final String html = "test1 test2 test 3
"
+ "test4
";
Document doc = Jsoup.parse(html);
StringBuilder sb = new StringBuilder();
for( Element element : doc.select("p") )
{
/*
* element.text() returns the text of this element (= without tags).
*/
sb.append(element.text()).append('\n');
}
System.out.println(sb.toString().trim());
Output:
test1 test2 test 3
test4