top of page
Delete Duplicate Emails
LeetCode Problem 196
Write a solution to delete all duplicate emails, keeping only one unique email with the smallest id.
For SQL users, please note that you are supposed to write a DELETE statement and not a SELECT one.
For Pandas users, please note that you are supposed to modify Person in place.
After running your script, the answer shown is the Person table. The driver will first compile and run your piece of code and then show the Person table. The final order of the Person table does not matter.
import pandas as pd
data = [[1, 'john@example.com'], [2, 'bob@example.com'], [3, 'john@example.com']]
person = pd.DataFrame(data, columns=['id', 'email']).astype({'id':'int64', 'email':'object'})
def delete_duplicate_emails(person: pd.DataFrame) -> None:
person.sort_values(by="id", ascending=True, inplace=True)
person.drop_duplicates("email", inplace=True)
bottom of page