Magic ETL | Remove Duplicates Tile | Select all fields option

It would be good to have a "Select All Columns" option for the remove duplicates tile in magic

Domo Arigato!

**Say 'Thanks' by clicking the thumbs up in the post that helped you.
**Please mark the post that solves your problem as 'Accepted Solution'
4
4 votes

Active · Last Updated

Comments

  • jaeW_at_Onyx
    jaeW_at_Onyx Budapest / Portland, OR 🔴

    @Godiepi that's a pretty obscure use case that you'd expect full-on row duplication.

    you could use MD5() hashing to concatenate all the values together and then use a RowNumber + rank_window tile then filter where rownumber = 1

    this is a classic way of handling identifying slowly changing dimension rows.

    Jae Wilson
    Check out my 🎥 Domo Training YouTube Channel 👨‍💻

    **Say "Thanks" by clicking the ❤️ in the post that helped you.
    **Please mark the post that solves your problem by clicking on "Accept as Solution"
  • I'd love to see a "select all" option in the remove duplicates tile, sort of like the same button on Select Columns where it loads up all the fields but then you can go back still and remove the ones you don't need or care about.

    I see plenty of cases where there's a junky input (wonky APIs, spreadsheets uploaded directly from hell with way to many columns, etc.) that needs a quick/easy button to cleanup/massage things before starting to explore the rows.

    The downside is it could easily pick up a null column and with existing behavior, return a zero row output. Bonus feature if the dedupe could skip null columns from inspection.