Hi All,
I got a problem when extract date from pdf.
I use a simple flow to capture pdf to xml. but
the content auto deduct 8 hours after exported to xml. pls help.
Best regards,
Joey Tsui
Hi All,
I got a problem when extract date from pdf.
I use a simple flow to capture pdf to xml. but
the content auto deduct 8 hours after exported to xml. pls help.
Best regards,
Joey Tsui
Can you share the original PDF so I can play around with it? You can do so in private if that is better.
@joeytsui : from what I can see, your data does not include the time. Therefore, the time is considered to mean 00:00 UTC, which is expressed as 12:00AM.
Since you are 8 hours ahead of UTC, that explains why you are missing those 8 hours when you fetch the data from the database.
ic, i add dateadd(hh,8 ,data) in my sql statement to handle, thanks a lot
just create a pdf and type any date in dd-MMM-yyyy format then extract it, it will happen again.
The data is “2025-02-11T16:00:00Z”. This includes the time (16:00) and timezone (the Z suffix at the end, which is UTC).
@Sander : I don’t believe so. From the screenshot, the original data is in PDF format and only the date is printed on it, with no time. What Notepad++ displays in that screenshot is the XML result from the data mapping operation, so that’s after the process has been executed. Well at least, that’s my understanding of it.
Ah I see, got it now.