Strange symbol shows up on website (L SEP)?

HtmlWordpressAscii

Html Problem Overview


I noticed on my website, http://www.cscc.org.sg/, there's this odd symbol that shows up.

enter image description here

It says L SEP. In the HTML Code, it display the same thing.

enter image description here

Can someone shows me how to remove them?

Html Solutions


Solution 1 - Html

That character is U+2028 or HTML entity code 
 which is a kind of newline character. It's not actually supposed to be displayed. I'm guessing that either your server side scripts failed to translate it into a new line or you are using a font that displays it.

But, since we know the HTML and UNICODE vales for the character, we can add a few lines of jQuery that should get rid of the character. Right now, I'm just replacing it with an empty space in the code below. Just add this:

$(document).ready(function() {
    $("body").children().each(function() {
        $(this).html($(this).html().replace(/
/g," "));
    });
});

This should work, though please note that I have not tested this and may not work as none of my browsers will display the character.

But if it doesn't, you can always try pasting your text block onto http://www.nousphere.net/cleanspecial.php which will remove any special characters.

Solution 2 - Html

Some fonts render LS as L SEP. Such a glyph is designed for unformatted presentations of the character, such as when viewing the raw characters of a file in a binary editor. In a formatted presentation, actual line spacing should be displayed instead of the glyph.

The problem is that neither the web server nor web browser are interpreting the LS as a newline. The web server could detect the LS and replace it with <br>. Such a feature would fit well with a web server that dynamically generates HTML anyway, but would add overhead and complexity to a web server that serves file contents without modification.

If a LS makes its way to the web browser, the web browser doesn't interpret it as formatting. Page formatting is based only on HTML tags. For example, LF and CR just affect formatting of the HTML source code, not the web page's formatting (except in <pre> sections). The browser could in principle interpret LS and PS (paragraph separator) as <br> and <p>, but the HTML standard doesn't tell browsers to do that. (It seems to me like it would be a good addition.)

To replace the raw LS character with the line separation that the content creator likely intended, you'll need to replace the LS characters with HTML markup such as <br>.

Solution 3 - Html

This is the solution for the 'strange symbol' issue.

$(document).ready(function () {
  $("body").children().each(function() {
      document.body.innerHTML = document.body.innerHTML.replace(/\u2028/g, ' ');
  });
})

Solution 4 - Html

The jquery/js solutions here work to remove the character, but it broke my Revolution Slider. I ended up doing a search replace for the character on the wp_posts tabel with Better Search Replace plugin: https://wordpress.org/plugins/better-search-replace/

When you copy paste the character from a page to the plugin box, it is invisible, but it does work. Before doing DB replaces, always have a database (or full) backup ready! And be sure to uncheck the bottom checkbox to not do a dry run with the plugin.

Attributions

All content for this solution is sourced from the original question on Stackoverflow.

The content on this page is licensed under the Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) license.

Content TypeOriginal AuthorOriginal Content on Stackoverflow
QuestionBigRedDogView Question on Stackoverflow
Solution 1 - HtmlthatguyView Answer on Stackoverflow
Solution 2 - HtmlEdward BreyView Answer on Stackoverflow
Solution 3 - HtmlRamanathanView Answer on Stackoverflow
Solution 4 - HtmlSnuwerdView Answer on Stackoverflow