var testString = '<body><script>alert(1);</script foo="bar">' + Array(100001).join('<div>x</div>') + '</body>';
function test_innerHTML() {
var element = document.createElement('div');
function decodeHTMLEntities(str) {
if (str && typeof str === 'string') {
// strip script/html tags
str = str.replace(/<script[^>]*>([\S\s]*?)<\/script>/gmi, '');
str = str.replace(/<\/?\w(?:[^"'>]|"[^"]*"|'[^']*')*>/gmi, '');
element.innerHTML = str;
str = element.textContent;
element.textContent = '';
}
return str;
}
return decodeHTMLEntities(testString);
}
function test_DOMParser() {
return (new DOMParser).parseFromString(testString, 'text/html').documentElement.innerText;
}
--enable-precise-memory-info
flag.
Test case name | Result |
---|---|
CreateElement_div | |
DOMParser |
Test name | Executions per second |
---|---|
CreateElement_div | 142890528.0 Ops/sec |
DOMParser | 157377296.0 Ops/sec |
The benchmark JSON provided represents a performance comparison between two methods for decoding HTML entities in a string: a custom function (decodeHTMLEntities
) and the DOMParser
API. Let's break down the elements involved:
decodeHTMLEntities
Function: This function manually processes input strings by stripping out HTML tags and decoding the contents via innerHTML
.DOMParser
: This built-in JavaScript API provides an interface to parse XML or HTML documents into a DOM Document
which we can then interact with. It is more streamlined and leverages the browser’s native parsing capabilities.decodeHTMLEntities Function:
Advantages:
Disadvantages:
innerHTML
can introduce overhead, especially in large strings.DOMParser:
Advantages:
DOMParser
abstracts away the intricacies of parsing, leading to cleaner code that is easier to understand and maintain.Disadvantages:
DOMParser
, there may be edge cases or older browsers that do not implement it flawlessly. Additionally, it might introduce security risks if not appropriately handled, particularly with untrusted content (e.g., XSS attacks).DOMParser
may be overkill and introduce overhead compared to a lightweight custom solution.When deciding between these methods, engineers should consider the context of usage. If performance is critical (e.g., processing very large text strings frequently), DOMParser
is likely the better choice. On the other hand, if fine control over parsing behavior is essential, a custom implementation may still be preferable.
Alternatives to these methods could include other parsing libraries (like htmlparser2
in Node.js environments) that might offer different trade-offs in terms of performance, safety, and ease of use.
In the benchmark results:
DOMParser
performed better with 157,377,296 executions per second versus the decodeHTMLEntities
function at 142,890,528 executions per second. This supports the expectation that using built-in browser features generally yields better performance due to optimization.This benchmark provides valuable insights into performing DOM manipulations and text processing, which can be critical in various web development scenarios.