Class WebLogAnalysis.FilterDocByKeyWords

  • All Implemented Interfaces:
    Serializable, org.apache.flink.api.common.functions.FilterFunction<org.apache.flink.api.java.tuple.Tuple2<String,​String>>, org.apache.flink.api.common.functions.Function
    Enclosing class:
    WebLogAnalysis

    public static class WebLogAnalysis.FilterDocByKeyWords
    extends Object
    implements org.apache.flink.api.common.functions.FilterFunction<org.apache.flink.api.java.tuple.Tuple2<String,​String>>
    MapFunction that filters for documents that contain a certain set of keywords.
    See Also:
    Serialized Form
    • Constructor Detail

      • FilterDocByKeyWords

        public FilterDocByKeyWords()
    • Method Detail

      • filter

        public boolean filter​(org.apache.flink.api.java.tuple.Tuple2<String,​String> value)
                       throws Exception
        Filters for documents that contain all of the given keywords and projects the records on the URL field.

        Output Format: 0: URL 1: DOCUMENT_TEXT

        Specified by:
        filter in interface org.apache.flink.api.common.functions.FilterFunction<org.apache.flink.api.java.tuple.Tuple2<String,​String>>
        Throws:
        Exception