December 25, 2021

Identifiers for some file formats

I managed to find a gold mine when it comes to URIs and identifiers: the DocBook notations module. DocBook is an older SGML/XML-based standard developed by OASIS used for writing documentations, and with its DTD comes a module that “imports” several then-common notations for use with unparsed entities.

In the world of DTDs, a notation is basically any specific format of characters or binary data. XML processing itself does not care about defined notations or entites that use them, but an application might use this information. Therefore, it is actually natively possible in XML to define that a value of a specific attribute is actually an external file and that the file should be loaded in a specific format. This feature may however be cool, but it's more of a security risk whenever it comes to loading external files from a storage format itself, so it is not used much in XMLs.

Existing notations may however be used in the world of linked data, thanks to the urn:publicid: namespace. This is actually a sort of a “free-for-all” namespace, since there is no authority that would assign these identifiers to entities or control their structure or lifetime. The exception to this are the so-called Formal Public Identifiers, which may be used to specify additional information about the entity, or to provide details about its ownership. RFC 3151 describes an algorithm that should be used when translating a public identifier to a URI, useful when it happens to be an FPI.

Without further ado, here are all the FPIs that I found in the module, as well as others that were found in other publications (like EAD), converted to URIs using the algorithm, implemented at my website.

CGM-CHAR
ISO 8632/2//NOTATION Character encoding//EN
urn:publicid:ISO+8632%2F2:NOTATION+Character+encoding:EN

CGM-BINARY
ISO 8632/3//NOTATION Binary encoding//EN
urn:publicid:ISO+8632%2F3:NOTATION+Binary+encoding:EN

CGM-CLEAR
ISO 8632/4//NOTATION Clear text encoding//EN
urn:publicid:ISO+8632%2F4:NOTATION+Clear+text+encoding:EN

EPS
+//ISBN 0-201-18127-4::Adobe//NOTATION PostScript Language Ref. Manual//EN
urn:publicid:%2B:ISBN+0-201-18127-4;Adobe:NOTATION+PostScript+Language+Ref.+Manual:EN

FAX
-//USA-DOD//NOTATION CCITT Group 4 Facsimile Type 1 Untiled Raster//EN
urn:publicid:-:USA-DOD:NOTATION+CCITT+Group+4+Facsimile+Type+1+Untiled+Raster:EN

GIF87a
-//CompuServe//NOTATION Graphics Interchange Format 87a//EN
urn:publicid:-:CompuServe:NOTATION+Graphics+Interchange+Format+87a:EN

GIF89a
-//CompuServe//NOTATION Graphics Interchange Format 89a//EN
urn:publicid:-:CompuServe:NOTATION+Graphics+Interchange+Format+89a:EN

IGES
-//USA-DOD//NOTATION (ASME/ANSI Y14.26M-1987) Initial Graphics Exchange Specification//EN
urn:publicid:-:USA-DOD:NOTATION+(ASME%2FANSI+Y14.26M-1987)+Initial+Graphics+Exchange+Specification:EN

SGML
ISO 8879:1986//NOTATION Standard Generalized Markup Language//EN
urn:publicid:ISO+8879%3A1986:NOTATION+Standard+Generalized+Markup+Language:EN

TEX
+//ISBN 0-201-13448-9::Knuth//NOTATION The TeXbook//EN
urn:publicid:%2B:ISBN+0-201-13448-9;Knuth:NOTATION+The+TeXbook:EN

JPEG
ISO/IEC 10918:1993//NOTATION Digital Compression and Coding of Continuous-tone Still Images (JPEG)//EN
urn:publicid:ISO%2FIEC+10918%3A1993:NOTATION+Digital+Compression+and+Coding+of+Continuous-tone+Still+Images+(JPEG):EN

MPEG1vid
ISO/IEC 11172-2:1993//NOTATION Information technology - Coding of moving pictures and associated audio for digital storage media at up to about 1,5 Mbit/s - Part 2: Video//EN
urn:publicid:ISO%2FIEC+11172-2%3A1993:NOTATION+Information+technology+-+Coding+of+moving+pictures+and+associated+audio+for+digital+storage+media+at+up+to+about+1%2C5+Mbit%2Fs+-+Part+2%3A+Video:EN

MPEG1aud
ISO/IEC 11172-3:1993//NOTATION Information technology - Coding of moving pictures and associated audio for digital storage media at up to about 1,5 Mbit/s - Part 3: Audio//EN
urn:publicid:ISO%2FIEC+11172-3%3A1993:NOTATION+Information+technology+-+Coding+of+moving+pictures+and+associated+audio+for+digital+storage+media+at+up+to+about+1%2C5+Mbit%2Fs+-+Part+3%3A+Audio:EN

MPEG2vid
ISO/IEC 13818-2:1995//NOTATION Information technology - Coding of moving pictures and associated audio: Part 2. Video//EN
urn:publicid:ISO%2FIEC+13818-2%3A1995:NOTATION+Information+technology+-+Coding+of+moving+pictures+and+associated+audio%3A+Part+2.+Video:EN

MPEG2aud
ISO/IEC 13818-3:1995//NOTATION Coding of moving pictures and associated audio: Part 3. Audio//EN
urn:publicid:ISO%2FIEC+13818-3%3A1995:NOTATION+Coding+of+moving+pictures+and+associated+audio%3A+Part+3.+Audio:EN

XML
ISO 8879:1986//NOTATION Extensible Markup Language (XML) 1.0//EN
urn:publicid:ISO+8879%3A1986:NOTATION+Extensible+Markup+Language+(XML)+1.0:EN

A lot of the following identifiers were taken from Making Hypermedia Work: A User's Guide to HyTime. Since they are given a name only and are all defined in the same section, they have the same overall pattern.

BMP
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Microsoft Windows bitmap//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Microsoft+Windows+bitmap:EN

GIF
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION CompuServe Graphic Interchange Format//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+CompuServe+Graphic+Interchange+Format:EN 

PCX
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION ZSoft PCX bitmap//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+ZSoft+PCX+bitmap:EN

WMF
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Microsoft Windows Metafile//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Microsoft+Windows+Metafile:EN

TIFF
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Aldus/Microsoft Tagged Interchange File Format//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Aldus%2FMicrosoft+Tagged+Interchange+File+Format:EN

PICT
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Apple Computer Quickdraw Picture//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Apple+Computer+Quickdraw+Picture:EN

MacPaint
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Apple Computer Macintosh Paint//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Apple+Computer+Macintosh+Paint:EN

GEM
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION CompuServe Graphic Interchange Format//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+CompuServe+Graphic+Interchange+Format:EN

IFF-ILBM
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Electronic Arts embedded bitmap//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Electronic+Arts+embedded+bitmap:EN

TARGA
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Truevision Targa Image Format//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Truevision+Targa+Image+Format:EN

SUNRASTER
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Sun Microsystems raster//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Sun+Microsystems+raster:EN

PBM
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Jef Poskanzer Portable Bit Map//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Jef+Poskanzer+Portable+Bit+Map:EN

XBM
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION MIT X Consortium bitmap//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+MIT+X+Consortium+bitmap:EN

XWD
+//ISBN 0-7923-9432-1::Graphic Notation/ /NOTATION MIT X Consortium Window Dump//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation%2F+%2FNOTATION+MIT+X+Consortium+Window+Dump:EN

JPEG-TIFF
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION JPEG TIFF encapsulation//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+JPEG+TIFF+encapsulation:EN

FITS
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION International Astronomical Union Flexible Image Transfer System//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+International+Astronomical+Union+Flexible+Image+Transfer+System:EN

DXF
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION AutoDesk Drawing Interchange Format//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+AutoDesk+Drawing+Interchange+Format:EN

HP-GL
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Hewlett Packard Graphics Language//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Hewlett+Packard+Graphics+Language:EN

LOTUS-PIC
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Lotus Development Corporation PIC//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Lotus+Development+Corporation+PIC:EN

UNIX-PLOT
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION UNIX Systems Labs plot//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+UNIX+Systems+Labs+plot:EN

HP-PCL
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Hewlett Packard Printer Control Language//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Hewlett+Packard+Printer+Control+Language:EN

EPS
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Adobe Systems Encapsulated PostScript//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Adobe+Systems+Encapsulated+PostScript:EN

CGM
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION ANSI/ISO Computer Graphics Metafile//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+ANSI%2FISO+Computer+Graphics+Metafile:EN

FLI
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Autodesk Animator FLI//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Autodesk+Animator+FLI:EN 

SLIDE
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Autodesk slide file//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Autodesk+slide+file:EN

EPSON
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Epson printer bitmap//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Epson+printer+bitmap:EN

FACESAVER
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Lou Katz bitmap//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Lou+Katz+bitmap:EN

GROUP3
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION CCITT Group 3 facsimile//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+CCITT+Group+3+facsimile:EN

GROUP4
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION CCITT Group 4 facsimile//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+CCITT+Group+4+facsimile:EN

GRASP-GL
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Microtex Ind. Graphical System for Presentation//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Microtex+Ind.+Graphical+System+for+Presentation:ENHP

PAINTJET
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Hewlett Packard PaintJet//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Hewlett+Packard+PaintJet:EN 

IGES
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION NIST Initial Graphic Exchange Standard//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+NIST+Initial+Graphic+Exchange+Standard:EN 

PDES
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION ISO Product Data Exchange Standard//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+ISO+Product+Data+Exchange+Standard:EN 

IG-TIFF
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Island Graphics TIFF subset//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Island+Graphics+TIFF+subset:EN 

KODAK-ICC
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Kodak ICC high resolution printer//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Kodak+ICC+high+resolution+printer:EN 

MSW-ICON
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Microsoft Windows Icon//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Microsoft+Windows+Icon:EN 

MSW-PAINT
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Microsoft Windows Paint//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Microsoft+Windows+Paint:EN 

MPEG
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Motion Picture Experts Group video//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Motion+Picture+Experts+Group+video:EN 

OFF
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Digital Equipment Corp. OFF CAD//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Digital+Equipment+Corp.+OFF+CAD:EN 

PC-PAINT
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION PC Paint bitmap//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+PC+Paint+bitmap:EN 

PIXAR
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Pixar Corp. PIC bitmap//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Pixar+Corp.+PIC+bitmap:EN 

RENDERMAN
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Pixar Corp. RIB metafile//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Pixar+Corp.+RIB+metafile:EN 

RIFF
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Microsoft/IBM Reverse IFF//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Microsoft%2FIBM+Reverse+IFF 

SGI
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Silicon Graphics bitmap//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Silicon+Graphics+bitmap:EN 

SIXEL
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Digital Equipment Corp. color bitmap//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Digital+Equipment+Corp.+color+bitmap:EN 

UTAH-RLE
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION University of Utah RLE color bitmap//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+University+of+Utah+RLE+color+bitmap:EN 

VICAR
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION Astronomical bitmap//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+Astronomical+bitmap:EN 

WP-GRAPHICS
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION WordPerfect graphic file//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+WordPerfect+graphic+file:EN 

XPM
+//ISBN 0-7923-9432-1::Graphic Notation//NOTATION MIT X Consortium color bitmap//EN
urn:publicid:%2B:ISBN+0-7923-9432-1;Graphic+Notation:NOTATION+MIT+X+Consortium+color+bitmap:EN

Sadly, I was not able to find any notation using the "+//IDN" prefix. See below.

Other URIs

There are other URIs to identify formats, better suited for Linked Data and similar technologies, than those PUBLIC identifiers. One may use general resources from DBPedia for example (such as http://dbpedia.org/resource/Portable_Network_Graphics), but there are other databases, sometimes well hidden:

https://www.w3.org/ns/formats/

This contains identifiers for various formats related to RDF. They are defined as instances of http://www.w3.org/ns/formats/Format and also provide some useful metadata.

Update 2024

I found a notation for JavaScript that I had stashed somewhere:
+//IDN netscape.com//NOTATION JavaScript//EN
urn:publicid:%2B:IDN+netscape.com:NOTATION+JavaScript:EN

No comments:

Post a Comment