Interface Office


public interface Office
Office Document properties collection. These properties apply to Office / Productivity Documents of all forms, including (but not limited to) MS Office and OpenDocument formats. This is a logical collection of properties, which may be drawn from a few different external definitions.
Since:
Apache Tika 1.2
  • Field Details

    • NAMESPACE_URI_DOC_META

      static final String NAMESPACE_URI_DOC_META
      See Also:
    • PREFIX_DOC_META

      static final String PREFIX_DOC_META
      See Also:
    • USER_DEFINED_METADATA_NAME_PREFIX

      static final String USER_DEFINED_METADATA_NAME_PREFIX
      For user defined metadata entries in the document, what prefix should be attached to the key names. eg <meta:user-defined meta:name="Info1">Text1</meta:user-defined> becomes custom:Info1=Text1
      See Also:
    • KEYWORDS

      static final Property KEYWORDS
      Keywords pertaining to a document. Also populates DublinCore.SUBJECT.
    • INITIAL_AUTHOR

      static final Property INITIAL_AUTHOR
      Name of the initial creator/author of a document
    • LAST_AUTHOR

      static final Property LAST_AUTHOR
      Name of the last (most recent) author of a document
    • AUTHOR

      static final Property AUTHOR
      Name of the principal author(s) of a document
    • CREATION_DATE

      static final Property CREATION_DATE
      When was the document created?
    • SAVE_DATE

      static final Property SAVE_DATE
      When was the document last saved?
    • SLIDE_COUNT

      static final Property SLIDE_COUNT
      The number of Slides are there in the (presentation) document
    • PAGE_COUNT

      static final Property PAGE_COUNT
      The number of Pages are there in the (paged) document
    • PARAGRAPH_COUNT

      static final Property PARAGRAPH_COUNT
      The number of individual Paragraphs in the document
    • LINE_COUNT

      static final Property LINE_COUNT
      The number of lines in the document
    • WORD_COUNT

      static final Property WORD_COUNT
      The number of Words in the document
    • CHARACTER_COUNT

      static final Property CHARACTER_COUNT
      The number of Characters in the document
    • CHARACTER_COUNT_WITH_SPACES

      static final Property CHARACTER_COUNT_WITH_SPACES
      The number of Characters in the document, including spaces
    • TABLE_COUNT

      static final Property TABLE_COUNT
      The number of Tables in the document
    • IMAGE_COUNT

      static final Property IMAGE_COUNT
      The number of Images in the document
    • OBJECT_COUNT

      static final Property OBJECT_COUNT
      The number of Objects in the document. These are typically non-Image resources embedded in the document, such as other documents or non-Image media.
    • MAPI_MESSAGE_CLASS

      static final Property MAPI_MESSAGE_CLASS
      MAPI message class. What type of .msg/MAPI file is it?
    • MAPI_SENT_BY_SERVER_TYPE

      static final Property MAPI_SENT_BY_SERVER_TYPE
    • MAPI_FROM_REPRESENTING_NAME

      static final Property MAPI_FROM_REPRESENTING_NAME
    • MAPI_FROM_REPRESENTING_EMAIL

      static final Property MAPI_FROM_REPRESENTING_EMAIL
    • MAPI_MESSAGE_CLIENT_SUBMIT_TIME

      static final Property MAPI_MESSAGE_CLIENT_SUBMIT_TIME
    • PROG_ID

      static final Property PROG_ID
      Embedded files may have a "progID" associated with them, such as Word.Document.12 or AcroExch.Document.DC
    • OCX_NAME

      static final Property OCX_NAME