loading page

Parsing Millions of URLs per Second
  • Yagiz Nizipli,
  • Daniel Lemire
Yagiz Nizipli
Universite TELUQ
Author Profile
Daniel Lemire
Universite TELUQ

Corresponding Author:daniel.lemire@teluq.ca

Author Profile

Abstract

URLs are fundamental elements of web applications. By applying vector algorithms, we built a fast standard-compliant C++ implementation. Our parser uses three times fewer instructions than competing parsers following WHATWG URL standard (e.g., Servo’s rust-url) and up to eight times fewer instructions than the popular curl parser. The Node.js environment adopted our C++ library. In our tests on realistic data, a recent Node.js version (20.0) with our parser is four to five times faster than the last version with the legacy URL parser.
02 Jun 2023Submitted to Software: Practice and Experience
02 Jun 2023Submission Checks Completed
02 Jun 2023Assigned to Editor
02 Jun 2023Review(s) Completed, Editorial Evaluation Pending
20 Jun 2023Reviewer(s) Assigned
21 Aug 2023Editorial Decision: Revise Major
13 Sep 20231st Revision Received
13 Sep 2023Submission Checks Completed
13 Sep 2023Assigned to Editor
13 Sep 2023Review(s) Completed, Editorial Evaluation Pending
16 Sep 2023Reviewer(s) Assigned