deep inside: security and tools

@rgaucher about.me Github LinkedIn fuckthespam

HTML5 tokenization visualization

Tue 05 February 2013

Few weeks ago, I spent some time creating a specialized visualization of the HTML5 tokenizer spec. As someone who tends to spend too much time with program analysis folks, the one thing that bothers me about the HTML5 is thatthere is no grammar. Indeed, the only specification is this tokenizer.

Not only this tokenizer is a f*-long-page description, but it's also not easy to navigate. I then decided to generate a simple grammar-like visualization of this tokenizer. You can find the viz here: HTML 5 Grammar Visualization and the blog talking about it here: Blog at SRL.

Right now, I'm thinking to improve the script that extracts the information (available here) so that I can have a better capture of the tokenizer transitions, and especially the HTML entity parser.

What do you think? Any thing you'd like to see in this visualization?

All entries

February 2013 — RSA 2013 speaking session
February 2013 — HTML5 tokenization visualization
September 2011 — PHP, Variable variables, Oh my!
July 2011 — Dissection of a SQL injection challenge
January 2010 — Yes, we need a standard to evaluate SAST, but it ain't easy...
November 2009 — Data driven factory: I give you data, you give me an object...
June 2009 — NIST Static Analysis Tool Exposition special publication released
December 2008 — Every-day's CSRF: Sorry, I turned off your christmas tree lights
August 2008 — Why the "line of code" is indeed a good metric
May 2008 — Accelerate the convergence to the bug: Running the test in 16-bit
February 2008 — Code review tools: the missing link (so far)
January 2008 — Talk: Problems and solutions for testing web application security scanners
October 2007 — IE6 And IE7 don't have compatible CSS tricks
September 2007 — Source Code Obfuscation
February 2007 — The return of the SVG XSS
February 2007 — How you should design a test suite for Web Apps Scanners
January 2007 — Test Suites for Web Application Scanners
December 2006 — SVG Files: XSS attacks