Class AbstractAsyncStateStreamOperator<OUT>
- java.lang.Object
-
- org.apache.flink.streaming.api.operators.AbstractStreamOperator<OUT>
-
- org.apache.flink.runtime.asyncprocessing.operators.AbstractAsyncStateStreamOperator<OUT>
-
- All Implemented Interfaces:
Serializable,org.apache.flink.api.common.state.CheckpointListener,KeyContext,KeyContextHandler,StreamOperator<OUT>,StreamOperatorStateHandler.CheckpointedStreamOperator,YieldingOperator<OUT>,AsyncStateProcessing,AsyncStateProcessingOperator
- Direct Known Subclasses:
AbstractAsyncStateUdfStreamOperator
@Internal public abstract class AbstractAsyncStateStreamOperator<OUT> extends AbstractStreamOperator<OUT> implements AsyncStateProcessingOperator
This operator is an abstract class that give theAbstractStreamOperatorthe ability to performAsyncStateProcessing. The aim is to make any subclass ofAbstractStreamOperatorcould manipulate async state with only a change of base class.- See Also:
- Serialized Form
-
-
Field Summary
Fields Modifier and Type Field Description protected DeclarationManagerdeclarationManager-
Fields inherited from class org.apache.flink.streaming.api.operators.AbstractStreamOperator
combinedWatermark, config, lastRecordAttributes1, lastRecordAttributes2, latencyStats, metrics, output, processingTimeService, stateHandler, stateKeySelector1, stateKeySelector2, timeServiceManager
-
-
Constructor Summary
Constructors Constructor Description AbstractAsyncStateStreamOperator()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description <K> voidasyncProcessWithKey(K key, org.apache.flink.util.function.ThrowingRunnable<Exception> processing)Asynchronously process a code with a key provided.voidbeforeInitializeStateHandler()Initialize necessary state components forAbstractStreamOperator.voidclose()This method is called at the very end of the operator's life, both in the case of a successful completion of the operation, and in the case of a failure and canceling.voiddrainStateRequests()voidfinish()This method is called at the end of data processing.<K> AsyncKeyedStateBackend<K>getAsyncKeyedStateBackend()ObjectgetCurrentKey()DeclarationManagergetDeclarationManager()Get the declaration manager for user-logic declaring.ElementOrdergetElementOrder()Get theElementOrderof this operator.<K,N>
InternalTimerService<N>getInternalTimerService(String name, org.apache.flink.api.common.typeutils.TypeSerializer<N> namespaceSerializer, Triggerable<K,N> triggerable)Returns aInternalTimerServicethat can be used to query current processing time and event time and to set timers.<N,S extends org.apache.flink.api.common.state.v2.State,T>
SgetOrCreateKeyedState(N defaultNamespace, org.apache.flink.api.common.typeutils.TypeSerializer<N> namespaceSerializer, org.apache.flink.api.common.state.v2.StateDescriptor<T> stateDescriptor)Create new state (v2) based on new state descriptor.<T> org.apache.flink.util.function.ThrowingConsumer<StreamRecord<T>,Exception>getRecordProcessor(int inputId)Get the record processor that could process record from input, which is the only entry for async processing.booleanisAsyncStateProcessingEnabled()Indicates whether this operator is enabling the async state.voidnewKeySelected(Object newKey)A hook that will be invoked after a new key is selected.voidpostProcessElement()A callback that will be triggered after an element finishesprocessElement.voidpostProcessWatermark(Watermark watermark)A hook that will be invoked after finishing advancing the watermark.voidprepareSnapshotPreBarrier(long checkpointId)This method is called when the operator should do a snapshot, before it emits its own checkpoint barrier.WatermarkpreProcessWatermark(Watermark watermark)A hook that will be triggered when receiving a watermark.voidpreserveRecordOrderAndProcess(org.apache.flink.util.function.ThrowingRunnable<Exception> processing)Check the order of same-key record, and then process the record.voidprocessRecordAttributes(RecordAttributes recordAttributes)voidprocessRecordAttributes1(RecordAttributes recordAttributes)voidprocessRecordAttributes2(RecordAttributes recordAttributes)voidprocessWatermark(WatermarkEvent watermark)voidprocessWatermark(Watermark mark)Process a watermark when receiving it.voidprocessWatermark1(WatermarkEvent watermark)voidprocessWatermark1Internal(WatermarkEvent watermark)voidprocessWatermark2(WatermarkEvent watermark)voidprocessWatermark2Internal(WatermarkEvent watermark)voidprocessWatermarkInternal(WatermarkEvent watermark)voidprocessWatermarkStatus(WatermarkStatus watermarkStatus)protected voidprocessWatermarkStatus(WatermarkStatus watermarkStatus, int index)protected voidreportOrForwardLatencyMarker(LatencyMarker marker)<T> voidsetAsyncKeyedContextElement(StreamRecord<T> record, org.apache.flink.api.java.functions.KeySelector<T,?> keySelector)Set key context for async state processing.voidsetKeyContextElement1(StreamRecord record)voidsetKeyContextElement2(StreamRecord record)-
Methods inherited from class org.apache.flink.streaming.api.operators.AbstractStreamOperator
getContainingTask, getExecutionConfig, getKeyedStateBackend, getKeyedStateStore, getMetricGroup, getOperatorConfig, getOperatorID, getOperatorName, getOperatorStateBackend, getOrCreateKeyedState, getPartitionedState, getPartitionedState, getProcessingTimeService, getRuntimeContext, getStateKeySelector1, getStateKeySelector2, getTimeServiceManager, getUserCodeClassloader, hasKeyContext1, hasKeyContext2, initializeState, initializeState, isUsingCustomRawKeyedState, notifyCheckpointAborted, notifyCheckpointComplete, open, processLatencyMarker, processLatencyMarker1, processLatencyMarker2, processWatermark1, processWatermark2, processWatermarkStatus1, processWatermarkStatus2, setCurrentKey, setMailboxExecutor, setProcessingTimeService, setup, snapshotState, snapshotState, useSplittableTimers
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Methods inherited from interface org.apache.flink.streaming.api.operators.KeyContextHandler
hasKeyContext
-
Methods inherited from interface org.apache.flink.streaming.api.operators.StreamOperator
getOperatorAttributes
-
-
-
-
Field Detail
-
declarationManager
protected DeclarationManager declarationManager
-
-
Method Detail
-
beforeInitializeStateHandler
public final void beforeInitializeStateHandler()
Initialize necessary state components forAbstractStreamOperator.- Overrides:
beforeInitializeStateHandlerin classAbstractStreamOperator<OUT>
-
isAsyncStateProcessingEnabled
public boolean isAsyncStateProcessingEnabled()
Description copied from class:AbstractStreamOperatorIndicates whether this operator is enabling the async state. Can be overridden by subclasses.- Specified by:
isAsyncStateProcessingEnabledin interfaceAsyncStateProcessing- Overrides:
isAsyncStateProcessingEnabledin classAbstractStreamOperator<OUT>- Returns:
- ture if async state processing is enabled.
-
getElementOrder
public ElementOrder getElementOrder()
Description copied from interface:AsyncStateProcessingOperatorGet theElementOrderof this operator.- Specified by:
getElementOrderin interfaceAsyncStateProcessingOperator
-
setAsyncKeyedContextElement
public final <T> void setAsyncKeyedContextElement(StreamRecord<T> record, org.apache.flink.api.java.functions.KeySelector<T,?> keySelector) throws Exception
Description copied from interface:AsyncStateProcessingOperatorSet key context for async state processing.- Specified by:
setAsyncKeyedContextElementin interfaceAsyncStateProcessingOperator- Type Parameters:
T- the type of the record.- Parameters:
record- the record.keySelector- the key selector to select a key from record.- Throws:
Exception
-
newKeySelected
public void newKeySelected(Object newKey)
A hook that will be invoked after a new key is selected. It is not recommended to perform async state here. Only some synchronous logic is suggested.- Parameters:
newKey- the new key selected.
-
postProcessElement
public final void postProcessElement()
Description copied from interface:AsyncStateProcessingOperatorA callback that will be triggered after an element finishesprocessElement.- Specified by:
postProcessElementin interfaceAsyncStateProcessingOperator
-
preserveRecordOrderAndProcess
public final void preserveRecordOrderAndProcess(org.apache.flink.util.function.ThrowingRunnable<Exception> processing)
Description copied from interface:AsyncStateProcessingOperatorCheck the order of same-key record, and then process the record. Mainly used when theAsyncStateProcessingOperator.getElementOrder()returnsElementOrder.RECORD_ORDER.- Specified by:
preserveRecordOrderAndProcessin interfaceAsyncStateProcessingOperator- Parameters:
processing- the record processing logic.
-
asyncProcessWithKey
public <K> void asyncProcessWithKey(K key, org.apache.flink.util.function.ThrowingRunnable<Exception> processing)Description copied from interface:AsyncStateProcessingOperatorAsynchronously process a code with a key provided.- Specified by:
asyncProcessWithKeyin interfaceAsyncStateProcessingOperator- Type Parameters:
K- the type of key.- Parameters:
key- the specified key.processing- the process logic.
-
getDeclarationManager
public final DeclarationManager getDeclarationManager()
Description copied from interface:AsyncStateProcessingOperatorGet the declaration manager for user-logic declaring.- Specified by:
getDeclarationManagerin interfaceAsyncStateProcessingOperator
-
getRecordProcessor
public final <T> org.apache.flink.util.function.ThrowingConsumer<StreamRecord<T>,Exception> getRecordProcessor(int inputId)
Description copied from interface:AsyncStateProcessingGet the record processor that could process record from input, which is the only entry for async processing.- Specified by:
getRecordProcessorin interfaceAsyncStateProcessing- Parameters:
inputId- the input identifier, start from 1. Borrow the design fromorg.apache.flink.streaming.api.operators.AbstractInput#inputId. This is only relevant if there is multiple inputs for the instance.
-
getOrCreateKeyedState
public <N,S extends org.apache.flink.api.common.state.v2.State,T> S getOrCreateKeyedState(@Nonnull N defaultNamespace, @Nonnull org.apache.flink.api.common.typeutils.TypeSerializer<N> namespaceSerializer, @Nonnull org.apache.flink.api.common.state.v2.StateDescriptor<T> stateDescriptor) throws ExceptionCreate new state (v2) based on new state descriptor.- Throws:
Exception
-
prepareSnapshotPreBarrier
public void prepareSnapshotPreBarrier(long checkpointId) throws ExceptionDescription copied from interface:StreamOperatorThis method is called when the operator should do a snapshot, before it emits its own checkpoint barrier.This method is intended not for any actual state persistence, but only for emitting some data before emitting the checkpoint barrier. Operators that maintain some small transient state that is inefficient to checkpoint (especially when it would need to be checkpointed in a re-scalable way) but can simply be sent downstream before the checkpoint. An example are opportunistic pre-aggregation operators, which have small the pre-aggregation state that is frequently flushed downstream.
Important: This method should not be used for any actual state snapshot logic, because it will inherently be within the synchronous part of the operator's checkpoint. If heavy work is done within this method, it will affect latency and downstream checkpoint alignments.
- Specified by:
prepareSnapshotPreBarrierin interfaceStreamOperator<OUT>- Overrides:
prepareSnapshotPreBarrierin classAbstractStreamOperator<OUT>- Parameters:
checkpointId- The ID of the checkpoint.- Throws:
Exception- Throwing an exception here causes the operator to fail and go into recovery.
-
getInternalTimerService
public <K,N> InternalTimerService<N> getInternalTimerService(String name, org.apache.flink.api.common.typeutils.TypeSerializer<N> namespaceSerializer, Triggerable<K,N> triggerable)
Returns aInternalTimerServicethat can be used to query current processing time and event time and to set timers. An operator can have several timer services, where each has its own namespace serializer. Timer services are differentiated by the string key that is given when requesting them, if you call this method with the same key multiple times you will get the same timer service instance in subsequent requests.Timers are always scoped to a key, the currently active key of a keyed stream operation. When a timer fires, this key will also be set as the currently active key.
Each timer has attached metadata, the namespace. Different timer services can have a different namespace type. If you don't need namespace differentiation you can use
VoidNamespaceSerializeras the namespace serializer.- Overrides:
getInternalTimerServicein classAbstractStreamOperator<OUT>- Type Parameters:
N- The type of the timer namespace.- Parameters:
name- The name of the requested timer service. If no service exists under the given name a new one will be created and returned.namespaceSerializer-TypeSerializerfor the timer namespace.triggerable- TheTriggerablethat should be invoked when timers fire
-
setKeyContextElement1
public void setKeyContextElement1(StreamRecord record) throws Exception
- Specified by:
setKeyContextElement1in interfaceStreamOperator<OUT>- Overrides:
setKeyContextElement1in classAbstractStreamOperator<OUT>- Throws:
Exception
-
setKeyContextElement2
public void setKeyContextElement2(StreamRecord record) throws Exception
- Specified by:
setKeyContextElement2in interfaceStreamOperator<OUT>- Overrides:
setKeyContextElement2in classAbstractStreamOperator<OUT>- Throws:
Exception
-
getCurrentKey
public Object getCurrentKey()
- Specified by:
getCurrentKeyin interfaceKeyContext- Overrides:
getCurrentKeyin classAbstractStreamOperator<OUT>
-
reportOrForwardLatencyMarker
protected void reportOrForwardLatencyMarker(LatencyMarker marker)
- Overrides:
reportOrForwardLatencyMarkerin classAbstractStreamOperator<OUT>
-
preProcessWatermark
public Watermark preProcessWatermark(Watermark watermark) throws Exception
A hook that will be triggered when receiving a watermark. Some async state can safely go within this method. Return the watermark that should be normally processed.- Parameters:
watermark- the receiving watermark.- Returns:
- the watermark that should be processed. Null if there is no need for following processing.
- Throws:
Exception
-
postProcessWatermark
public void postProcessWatermark(Watermark watermark) throws Exception
A hook that will be invoked after finishing advancing the watermark. It is not recommended to perform async state here. Only some synchronous logic is suggested.- Parameters:
watermark- the advanced watermark.- Throws:
Exception
-
processWatermark
public final void processWatermark(Watermark mark) throws Exception
Process a watermark when receiving it. Do not override this method since the async processing is difficult to write. Please override the hooks, seepreProcessWatermark(Watermark)andpostProcessWatermark(Watermark). The basic logic of processWatermark with hooks in sync form would be:Watermark watermark = preProcessWatermark(mark); if (watermark != null) { super.processWatermark(watermark); postProcessWatermark(watermark); }- Overrides:
processWatermarkin classAbstractStreamOperator<OUT>- Throws:
Exception
-
processWatermarkStatus
public void processWatermarkStatus(WatermarkStatus watermarkStatus) throws Exception
- Overrides:
processWatermarkStatusin classAbstractStreamOperator<OUT>- Throws:
Exception
-
processWatermarkStatus
protected void processWatermarkStatus(WatermarkStatus watermarkStatus, int index) throws Exception
- Overrides:
processWatermarkStatusin classAbstractStreamOperator<OUT>- Throws:
Exception
-
processRecordAttributes
@Experimental public void processRecordAttributes(RecordAttributes recordAttributes) throws Exception
- Overrides:
processRecordAttributesin classAbstractStreamOperator<OUT>- Throws:
Exception
-
processRecordAttributes1
@Experimental public void processRecordAttributes1(RecordAttributes recordAttributes)
- Overrides:
processRecordAttributes1in classAbstractStreamOperator<OUT>
-
processRecordAttributes2
@Experimental public void processRecordAttributes2(RecordAttributes recordAttributes)
- Overrides:
processRecordAttributes2in classAbstractStreamOperator<OUT>
-
processWatermarkInternal
public void processWatermarkInternal(WatermarkEvent watermark) throws Exception
- Throws:
Exception
-
processWatermark1Internal
public void processWatermark1Internal(WatermarkEvent watermark) throws Exception
- Throws:
Exception
-
processWatermark2Internal
public void processWatermark2Internal(WatermarkEvent watermark) throws Exception
- Throws:
Exception
-
processWatermark
public void processWatermark(WatermarkEvent watermark) throws Exception
- Overrides:
processWatermarkin classAbstractStreamOperator<OUT>- Throws:
Exception
-
processWatermark1
public void processWatermark1(WatermarkEvent watermark) throws Exception
- Overrides:
processWatermark1in classAbstractStreamOperator<OUT>- Throws:
Exception
-
processWatermark2
public void processWatermark2(WatermarkEvent watermark) throws Exception
- Overrides:
processWatermark2in classAbstractStreamOperator<OUT>- Throws:
Exception
-
getAsyncKeyedStateBackend
public <K> AsyncKeyedStateBackend<K> getAsyncKeyedStateBackend()
-
drainStateRequests
public void drainStateRequests()
-
finish
public void finish() throws ExceptionDescription copied from interface:StreamOperatorThis method is called at the end of data processing.The method is expected to flush all remaining buffered data. Exceptions during this flushing of buffered data should be propagated, in order to cause the operation to be recognized as failed, because the last data items are not processed properly.
After this method is called, no more records can be produced for the downstream operators.
WARNING: It is not safe to use this method to commit any transactions or other side effects! You can use this method to flush any buffered data that can later on be committed e.g. in a
CheckpointListener.notifyCheckpointComplete(long).NOTE:This method does not need to close any resources. You should release external resources in the
StreamOperator.close()method.- Specified by:
finishin interfaceStreamOperator<OUT>- Overrides:
finishin classAbstractStreamOperator<OUT>- Throws:
Exception- An exception in this method causes the operator to fail.
-
close
public void close() throws ExceptionDescription copied from interface:StreamOperatorThis method is called at the very end of the operator's life, both in the case of a successful completion of the operation, and in the case of a failure and canceling.This method is expected to make a thorough effort to release all resources that the operator has acquired.
NOTE:It can not emit any records! If you need to emit records at the end of processing, do so in the
StreamOperator.finish()method.- Specified by:
closein interfaceStreamOperator<OUT>- Overrides:
closein classAbstractStreamOperator<OUT>- Throws:
Exception
-
-