If we were to create a simple dataset based on the given string and assuming we have more data like this: