I've seen this on every Yahoo! news page, at the bottom of the source code,
and failed to understand why they break the script word like that.
Does anybody know if there's any reason for this?
Consider this simplified example:
<script> document.write("something </script> something"); </script>
The browser's HTML parser would see the
<script> element ends at the next
(It also knows that you can't have nested
<script> elements, hence the breaking of the opening
<script> as well as the closing
</script> in your example.)
Suppose you are writing a tool that detects the beginning and end of script blocks in a chunk of text. Suppose you see
<blah><blahdeblah><script> blah blah blah blah print("</script>") print("<script>") blah </script> </blahdeblah></blah>
Without knowing the syntax of the script language, how does your tool know that this is ONE script block and not TWO script blocks with ")blah between them?
A web browser is such a tool. It's a reasonable practice to make sure you never confuse the web browser by never having
</script> in your file unless it actually is a script tag.
so that it doesn't get evaluated but gets inserted as a string.
It's a bad way to prevent XML/XHTML and HTML validators from yelling at the source code.
For a full discussion of this, see:
The short answer is that your code is parsed in two discrete steps.
By splitting up the tag with a string concatenate operator you prevent a constant from tripping up the XML phase.
©2020 All rights reserved.