FindMatches class
Identifies matching records in the input DynamicFrame and creates a new DynamicFrame with a unique identifier assigned to each group of matching records.
To import:
from awsglueml.transforms import FindMatches
Methods
apply(frame, transformId, transformation_ctx = "", info = "", stageThreshold = 0, totalThreshold = 0, enforcedMatches = none, computeMatchConfidenceScores = 0)
Identifies matching records in the input DynamicFrame and creates a new DynamicFrame with a unique identifier assigned to each group of matching records.
frame– TheDynamicFrameto apply the FindMatches transform. Required.transformId– The unique ID associated with the FindMatches transform to apply on records in theDynamicFrame. Required.transformation_ctx– A unique string that is used to identify stats/state information. Optional.info– A string to be associated with errors in the transformation. Optional.stageThreshold– The maximum number of errors that can occur in the transformation before it errors out. Optional. The default is zero.totalThreshold– The maximum number of errors that can occur overall before processing errors out. Optional. The default is zero.enforcedMatches– TheDynamicFrameused to enforce matches. Optional. The default is None.computeMatchConfidenceScores– A Boolean value indicating whether to compute a confidence score for each group of matching records. Optional. The default is false.
Returns a new DynamicFrame with a unique identifier assigned to each group of matching records.