I have a couple columns of data; Project and ProductLineRank. In a document, the project will be repeated several times and the ProductLineRank will be the rank assigned to that product line. The dataset represents a list of products sold by project and the associated ranks. Like so:
I push these results into a data process step where I'm trying to combine the documents and calculate the minimum value (highest rank) such that each project only has one line. In this case, I would want the output to be:
Has anyone had success using aggregate functions using groovy in a custom scripting step within a data process step? That's how I'm trying to handle it but I'm unfamiliar with scripting. Or is my approach totally wrong?