Yes, you can trust the Distinct plugin. This de-duplicates input records so that all output records are distinct.
Use the Joiner analytics plugin to combine data from multiple inputs. Joins are based on equality. Supports inner and outer joins, selection and renaming of output fields. You can add a Joiner transformation at any stage in a data pipeline.
Can the Distinct plugin be moved or placed as the first stage of the ETL pipeline to prevent data that has already been loaded to load again. For a ETL job that runs daily?
No, it cannot be moved or placed at the first stage because this needs to have data to check if there is a duplicate from the data you are loading.
You could add an extra stage/step in your pipeline using the Deduplicate Plugin this will allow you to add extra filters to the loading of the data.