PHP removing html tags from string

后端未结

关注

 6  1820

I have string:

Verslo centrai Lietuvos nekilnojamojo turto plėtros asociacijos konkurse  ...

相关标签:

6条回答

感动是毒

2020-12-02 02:16
This will remove every thing - tags, ascii, line breaks but pure text:
```
strip_tags(preg_replace('/<[^>]*>/','',str_replace(array("&nbsp;","\n","\r"),"",html_entity_decode($YOUR_STRING,ENT_QUOTES,'UTF-8'))));
```
0 讨论(0)
发布评论:

提交评论
- 加载中...
温柔的废话

2020-12-02 02:18
From PHP 7.4.0 the strip_tags() alternatively accepts an array with allowable tags,

then this:
```
<?php

$html = '<div id="my-div">text<a href="#link"></a></div>';

echo strip_tags($html, ['p', 'a']); //accept p and a tags
```
Return this:
```
text<a href="#link"></a>
```
Note that only the disallowed tags have been removed.
0 讨论(0)
发布评论:

提交评论
- 加载中...
谎友^

2020-12-02 02:20
Since your HTML is not properly formatted you could choose a preg_replace() approach:
```
$text = 'Verslo centrai Lietuvos nekilnojamojo turto plėtros asociacijos konkurse ... ';
$content = preg_replace('/<[^>]*>/', '', $text); 
var_dump($content);
// string(108) "Verslo centrai Lietuvos nekilnojamojo turto plėtros asociacijos konkurse ... "
```
Codepad Example

On strip_tags() docs it says: Because strip_tags() does not actually validate the HTML, partial or broken tags can result in the removal of more text/data than expected.

Also second parameter is for $allowable_tags.
0 讨论(0)
发布评论:

提交评论
- 加载中...
余生分开走

2020-12-02 02:27
This will replace all html tags, https://regex101.com/r/jM9oS4/4
```
preg_replace('/<(|\/)(?!\?).*?(|\/)>/',$replacement,$string);
```
0 讨论(0)
发布评论:

提交评论
- 加载中...
情深已故

2020-12-02 02:33
Try to put it like that
```
$content = strip_tags($text);
```
Or you can do it with regular expression like that:
```
$content = preg_replace('/<[^>]*>/', '', $text);
```
By this $content = strip_tags($text, ''); you are allowing the  tag in the string.

For more info see the link http://php.net/manual/en/function.strip-tags.php
0 讨论(0)
发布评论:

提交评论
- 加载中...
一个人的身影

2020-12-02 02:36
Since the HTML is poorly formated you probably need to either write your own regexp to remove tags or clean up the HTML before trying to remove tags.

You could try this to remove everything that "looks like" a tag:
```
$str = preg_replace("/<.*?>/", " ", $str);
```
0 讨论(0)
发布评论:

提交评论
- 加载中...