Class MediaTypeRegistry

  • All Implemented Interfaces:
    java.io.Serializable

    public class MediaTypeRegistry
    extends java.lang.Object
    implements java.io.Serializable
    Registry of known Internet media types.
    See Also:
    Serialized Form
    • Constructor Detail

      • MediaTypeRegistry

        public MediaTypeRegistry()
    • Method Detail

      • getDefaultRegistry

        public static MediaTypeRegistry getDefaultRegistry()
        Returns the built-in media type registry included in Tika.
        Returns:
        default media type registry
        Since:
        Apache Tika 0.8
      • getTypes

        public java.util.SortedSet<MediaType> getTypes()
        Returns the set of all known canonical media types. Type aliases are not included in the returned set.
        Returns:
        canonical media types
        Since:
        Apache Tika 0.8
      • getAliases

        public java.util.SortedSet<MediaType> getAliases​(MediaType type)
        Returns the set of known aliases of the given canonical media type.
        Parameters:
        type - canonical media type
        Returns:
        known aliases
        Since:
        Apache Tika 0.8
      • getChildTypes

        public java.util.SortedSet<MediaType> getChildTypes​(MediaType type)
        Returns the set of known children of the given canonical media type
        Parameters:
        type - canonical media type
        Returns:
        known children
        Since:
        Apache Tika 1.8
      • addType

        public void addType​(MediaType type)
      • isSpecializationOf

        public boolean isSpecializationOf​(MediaType a,
                                          MediaType b)
        Checks whether the given media type a is a specialization of a more generic type b. Both types should be already normalised.
        Parameters:
        a - media type, normalised
        b - suspected supertype, normalised
        Returns:
        true if b is a supertype of a, false otherwise
        Since:
        Apache Tika 0.8
      • isInstanceOf

        public boolean isInstanceOf​(MediaType a,
                                    MediaType b)
        Checks whether the given media type equals the given base type or is a specialization of it. Both types should be already normalised.
        Parameters:
        a - media type, normalised
        b - base type, normalised
        Returns:
        true if b equals a or is a specialization of it, false otherwise
        Since:
        Apache Tika 1.2
      • isInstanceOf

        public boolean isInstanceOf​(java.lang.String a,
                                    MediaType b)
        Parses and normalises the given media type string and checks whether the result equals the given base type or is a specialization of it. The given base type should already be normalised.
        Parameters:
        a - media type
        b - base type, normalised
        Returns:
        true if b equals a or is a specialization of it, false otherwise
        Since:
        Apache Tika 1.2
      • getSupertype

        public MediaType getSupertype​(MediaType type)
        Returns the supertype of the given type. If the media type database has an explicit inheritance rule for the type, then that is used. Next, if the given type has any parameters, then the respective base type (parameter-less) is returned. Otherwise built-in heuristics like text/... -> text/plain and .../...+xml -> application/xml are used. Finally application/octet-stream is returned for all types for which no other supertype is known, and the return value for application/octet-stream is null.
        Parameters:
        type - media type
        Returns:
        supertype, or null for application/octet-stream
        Since:
        Apache Tika 0.8