Class MimeTypesReader

  • All Implemented Interfaces:
    MimeTypesReaderMetKeys, org.xml.sax.ContentHandler, org.xml.sax.DTDHandler, org.xml.sax.EntityResolver, org.xml.sax.ErrorHandler

    public class MimeTypesReader
    extends org.xml.sax.helpers.DefaultHandler
    implements MimeTypesReaderMetKeys
    A reader for XML files compliant with the freedesktop MIME-info DTD.
      <!DOCTYPE mime-info [
        <!ELEMENT mime-info (mime-type)+>
        <!ATTLIST mime-info xmlns CDATA #FIXED "http://www.freedesktop.org/standards/shared-mime-info">
     
        <!ELEMENT mime-type (comment|acronym|expanded-acronym|glob|magic|root-XML|alias|sub-class-of)*>
        <!ATTLIST mime-type type CDATA #REQUIRED>
     
        <!-- a comment describing a document with the respective MIME type. Example: "WMV video" -->
        <!ELEMENT _comment (#PCDATA)>
        <!ATTLIST _comment xml:lang CDATA #IMPLIED>
     
        <!-- a comment describing a the respective unexpanded MIME type acronym. Example: "WMV" -->
        <!ELEMENT acronym (#PCDATA)>
        <!ATTLIST acronym xml:lang CDATA #IMPLIED>
     
        <!-- a comment describing a the respective unexpanded MIME type acronym. Example: "Windows Media Video" -->
        <!ELEMENT expanded-acronym (#PCDATA)>
        <!ATTLIST expanded-acronym xml:lang CDATA #IMPLIED>
     
        <!ELEMENT glob EMPTY>
        <!ATTLIST glob pattern CDATA #REQUIRED>
        <!ATTLIST glob isregex CDATA #IMPLIED>
     
        <!ELEMENT magic (match)+>
        <!ATTLIST magic priority CDATA #IMPLIED>
     
        <!ELEMENT match (match)*>
        <!ATTLIST match offset CDATA #REQUIRED>
        <!ATTLIST match type (string|big16|big32|little16|little32|host16|host32|byte) #REQUIRED>
        <!ATTLIST match value CDATA #REQUIRED>
        <!ATTLIST match mask CDATA #IMPLIED>
     
        <!ELEMENT root-XML EMPTY>
        <!ATTLIST root-XML
              namespaceURI CDATA #REQUIRED
              localName CDATA #REQUIRED>
     
        <!ELEMENT alias EMPTY>
        <!ATTLIST alias
              type CDATA #REQUIRED>
     
       <!ELEMENT sub-class-of EMPTY>
       <!ATTLIST sub-class-of
             type CDATA #REQUIRED>
      ]>
     
    In addition to the standard fields, this will also read two Tika specific fields: - link - uti
    See Also:
    http://freedesktop.org/wiki/Standards_2fshared_2dmime_2dinfo_2dspec
    • Method Detail

      • resolveEntity

        public org.xml.sax.InputSource resolveEntity​(java.lang.String publicId,
                                                     java.lang.String systemId)
        Specified by:
        resolveEntity in interface org.xml.sax.EntityResolver
        Overrides:
        resolveEntity in class org.xml.sax.helpers.DefaultHandler
      • startElement

        public void startElement​(java.lang.String uri,
                                 java.lang.String localName,
                                 java.lang.String qName,
                                 org.xml.sax.Attributes attributes)
                          throws org.xml.sax.SAXException
        Specified by:
        startElement in interface org.xml.sax.ContentHandler
        Overrides:
        startElement in class org.xml.sax.helpers.DefaultHandler
        Throws:
        org.xml.sax.SAXException
      • endElement

        public void endElement​(java.lang.String uri,
                               java.lang.String localName,
                               java.lang.String qName)
        Specified by:
        endElement in interface org.xml.sax.ContentHandler
        Overrides:
        endElement in class org.xml.sax.helpers.DefaultHandler
      • characters

        public void characters​(char[] ch,
                               int start,
                               int length)
        Specified by:
        characters in interface org.xml.sax.ContentHandler
        Overrides:
        characters in class org.xml.sax.helpers.DefaultHandler
      • setPoolSize

        public static void setPoolSize​(int poolSize)
                                throws TikaException
        Set the pool size for cached XML parsers.
        Parameters:
        poolSize -
        Throws:
        TikaException