I'm running a very large PostgreSQL database. At some point in the past a developer created a duplicate removal script for me. Problem is I have no documentation telling me what it does. I only have a link from where I can run this scriopt. I haven't needed to use it in a very long time but recently uploaded new records to the database. I started the script running and removed what it determined to be duplicate records.
The problem is I don't really know what logic it is using to determine if something is a duplicate. Fields were added to the database after the development of this script. Likely it does not account for these in determining what is a duplicate. I'm concerned running this will remove records that are not duplicates but have some fields in common. As I mentioned I started the script and after it ran for many hours I stopped it and checked the count. Many records were removed but I'm not sure what records. I do know there are duplicates in the database so likely it did fine.
The job is only for someone to locate the script on the server and write something explaining what this script is doing. I'm not asking to have it modified as that will be a seperate job. For now I just would like to know exactly what it is set to do. As stated, I have the link to run it but don't see the script. You will need to find it.
Thank you for your consideration.