Package com.day.text

Class TextUtils


  • public class TextUtils
    extends java.lang.Object
    Simple text manipulation tasks - these aren't optimized, and are intended primarily to be used by test code. (eg. to compare returned values from functions with the values expected).
    • Constructor Summary

      Constructors 
      Constructor Description
      TextUtils()  
    • Method Summary

      All Methods Static Methods Instance Methods Concrete Methods 
      Modifier and Type Method Description
      static int countOccurences​(java.lang.String big, java.lang.String small)  
      static boolean isTextSimilar​(java.lang.String docOne, java.lang.String docTwo)
      Compare two strings, and determine whether they are approximately "similar" - by this we mean whether they have the same numbers of each character in (barring whitespace).
      static boolean noisyIsTextSimilar​(java.lang.String docOne, java.lang.String docTwo)  
      static boolean occursExactlyNTimes​(java.lang.String big, java.lang.String small, int n)  
      static boolean occursOnceOnly​(java.lang.String big, java.lang.String small)  
      static java.lang.String removeControlCharacters​(java.lang.String characters)  
      static java.lang.String removeWhitespace​(java.lang.String characters)  
      static java.lang.String replace​(java.lang.String allOfTheText, java.lang.String toFind, java.lang.String toReplace)
      Search and replace
      static java.lang.String sanitizeForCommunique​(java.lang.String s)
      These are characters not allowed in a text string in a CQ any file - they require escaping.
      java.lang.String stripWhitespace​(java.lang.String s)
      Remove all whitespace from a string - most useful for test purposes (where you compare two strings, but whitespace isn't syntactically significant)
      static java.lang.String trimLastSlashLevel​(java.lang.String input)
      Removes last level from a "slashy string" representaion of a hierarchical path.
      static java.lang.String trimLeadingSlashes​(java.lang.String input)  
      static java.lang.String trimTrailingSlashes​(java.lang.String input)  
      • Methods inherited from class java.lang.Object

        equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
    • Constructor Detail

      • TextUtils

        public TextUtils()
    • Method Detail

      • removeControlCharacters

        public static java.lang.String removeControlCharacters​(java.lang.String characters)
      • removeWhitespace

        public static java.lang.String removeWhitespace​(java.lang.String characters)
      • isTextSimilar

        public static boolean isTextSimilar​(java.lang.String docOne,
                                            java.lang.String docTwo)
        Compare two strings, and determine whether they are approximately "similar" - by this we mean whether they have the same numbers of each character in (barring whitespace). We do this by sorting the strings as a character array and stripping whitespace.

        This will generate false positives on some text - for example anagrams.

        For example, the text "Watch the carthorse" and "Watch the orchestra" will both be converted to "aacceehhhorrstttw" and compared - so they will be identical.

        This method is mostly useful as a simple way of comparing data which can be differently ordered but still functionally equivalent - for example DTDs.

      • noisyIsTextSimilar

        public static boolean noisyIsTextSimilar​(java.lang.String docOne,
                                                 java.lang.String docTwo)
      • stripWhitespace

        public java.lang.String stripWhitespace​(java.lang.String s)
        Remove all whitespace from a string - most useful for test purposes (where you compare two strings, but whitespace isn't syntactically significant)
      • sanitizeForCommunique

        public static java.lang.String sanitizeForCommunique​(java.lang.String s)
        These are characters not allowed in a text string in a CQ any file - they require escaping.
      • replace

        public static java.lang.String replace​(java.lang.String allOfTheText,
                                               java.lang.String toFind,
                                               java.lang.String toReplace)
        Search and replace
      • trimTrailingSlashes

        public static java.lang.String trimTrailingSlashes​(java.lang.String input)
      • trimLeadingSlashes

        public static java.lang.String trimLeadingSlashes​(java.lang.String input)
      • occursOnceOnly

        public static boolean occursOnceOnly​(java.lang.String big,
                                             java.lang.String small)
      • occursExactlyNTimes

        public static boolean occursExactlyNTimes​(java.lang.String big,
                                                  java.lang.String small,
                                                  int n)
      • countOccurences

        public static int countOccurences​(java.lang.String big,
                                          java.lang.String small)
      • trimLastSlashLevel

        public static java.lang.String trimLastSlashLevel​(java.lang.String input)
        Removes last level from a "slashy string" representaion of a hierarchical path. For example, '/a/b/c' becomes '/a/b' with the exception that '/' and '' remain unchanged