Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Save timezone #4386

Closed
wants to merge 28 commits into from
Closed

Save timezone #4386

wants to merge 28 commits into from

Conversation

dannyroberts
Copy link
Member

@czue @snopoke

I'm opening this for preliminary review. (Let me know if you think there's a better workflow than opening a PR that we don't intend to merge. I like to do this sometimes to get feedback on a direction I'm going when I'm about half-way.)

Might want to scan the diff first and then review commit by commit.

Here's my progress so far:

  • Make jsonobject support replacing default handlers Type configs jsonobject#97
  • Write an UTCDateTimeProperty that accepts any valid ISO date/datetime and translates it to UTC, and saves up to microseconds. (Lenient input, strict output.)
  • Make sure UTCDateTimeProperty saves timezone information (see commits in Save timezone utc date time #4499)
  • Make XFormInstance use the UTCDateTimeProperty exclusively (both for explicit properties and dynamic properties)
  • Mark XFormInstances processed the new way with a is_timezone_aware flag set to true
  • Make CommCareCases use the new property exclusively
  • Write script to reprocess form xml (using the changes feed or a view that iterates through all forms)
  • Write a script that reprocesses all cases with affected forms (might as well be all cases). Verify that this has the expected behavior of updating the case actions to include the timezone-modified dates
  • ...make sure all reports take the timezone change into account

@dannyroberts dannyroberts added Open for review: do not merge A work in progress awaiting QA QA in progress. Do not merge labels Sep 24, 2014
@czue
Copy link
Member

czue commented Sep 25, 2014

looks good so far to me!

@snopoke
Copy link
Contributor

snopoke commented Sep 30, 2014

looks good, just wondering why you're introducing ios8601 dependency? Can't you get the same functionality from date_util and pytz?


def wrap(self, obj):
dt = iso8601.parse_date(obj)
return dt.astimezone(iso8601.iso8601.UTC).replace(tzinfo=None)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would it be very hard to make this preserve the original time zone so it is not lost on wrap/unwrap?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think it'd be hard, just not sure if it's desired. We rely pretty heavily on the fact that these datetime strings sort correctly with a simple lexical sort; if you have the timezone info there the way it was originally, they won't sort correctly anymore.

There are other schemes we could invent to solve this, like

YYYY-MM-DDThh:mm:ss.uuuuuuZ+05:30

where Z means it's a UTC time and "+05:30" means that it was originally given in India time. Baking a made up datetime format into this property feels weird to me, so I opted for just switching to UTC and calling a day, as the smallest change that fixes our problem.

I'm actually very willing to be convinced of a better way that's also feasible in the short term, or be convinced that one of the options I scrapped is actually better than what I'm doing.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I handn't thought about lexical sorting, and that's a very good point.

I guess a question we should answer first is do we ever need the original timezone? Would it make some parts of the app more usable to display in the original timezone? Or are we always wanting to use- and able to determine the local timezone of the user who is viewing the data?

As data scientist I think it is ideal not to discard data, even if we don't immediately have a use for it. But sometimes practicality beats purity too. Other than needing a custom parsing function, what are the problems with the custom format you proposed above?

@dannyroberts
Copy link
Member Author

@snopoke I'm not sure and you might be correct, but I was looking for something that narrowly accepts ISO8601 date/datetime strings and nothing else, and I believe date_util fights to the death to try to accept anything that looks remotely like a date or datetime. Does that make sense?

@snopoke
Copy link
Contributor

snopoke commented Sep 30, 2014

Yea that makes sense. Does this need to be normalized at all (see not in
introduction here: http://pytz.sourceforge.net/)?

@dannyroberts
Copy link
Member Author

@snopoke I'm not sure how that problem would bite us. As long as the phone time is set correctly, we're always given timezone offset ("+05:30") not a location ("India time"), so daylight savings etc should be baked into that timezone offset already and not something we need to worry about.

Does that sound right to you, or am I missing something?

@snopoke
Copy link
Contributor

snopoke commented Oct 1, 2014

True, good point.

@dannyroberts dannyroberts mentioned this pull request Oct 8, 2014
2 tasks
@@ -127,7 +127,7 @@ def testParseWithIndices(self):
# quick test for ota restore
v2response = phone_views.xml_for_case(HttpRequest(), case.get_id, version="2.0")
expected_v2_response = """
<case case_id="foo-case-id" date_modified="2011-12-07T13:42:50Z" user_id="bar-user-id" xmlns="http://commcarehq.org/case/transaction/v2">
<case case_id="foo-case-id" date_modified="2011-12-07T18:42:50Z" user_id="bar-user-id" xmlns="http://commcarehq.org/case/transaction/v2">
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

E501 line too long (145 > 115 characters)

to keep track of whether an XFormInstance
has been migrated to process timezones correctly
after timezone changes
and transforms them to utc datetimes with time zeroed out
…tetime

(we use timezone-unaware datetimes throughout to mean UTC)
for succinctly preforming many simple input/output matching tests
while otherwise acting like a tz-naive datetime
that is implicitly in UTC

also rename ISO8601Property to UTCDateTimeProperty
- split UTCDateTime into separate corehq.ext.datetime module
- minor cleanup to UTCDateTime
- give option to pass in tz string instead of timedelta for UTCDateTime original_offset
- change UTCDateTime repr to use tz string notation for succinctness
for converting back to a normal timezone-aware datetime
when no timzone is given
datetimes are still treated as UTC, but the original_offset is given as None
and the timezone hint (+00:00) is excluded from the output format
make certain tests use exact_equals for comparison instead

also improve corpus test docstrings slightly
exposed an issue with ISODocument
and add tests

fixes commtrack tests

@staticmethod
def _validate_tz_offset(hours, minutes, seconds):
if not (0 <= hours <= 15):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

comparison does not match error message: 0 <= hours <= 15 != between 0 and 14 (inclusive)

@snopoke
Copy link
Contributor

snopoke commented Oct 30, 2014

Test failure real

@proteusvacuum proteusvacuum deleted the save-timezone branch June 28, 2016 21:44
@snopoke snopoke restored the save-timezone branch March 19, 2017 00:06
@dannyroberts dannyroberts deleted the save-timezone branch March 29, 2019 20:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
awaiting QA QA in progress. Do not merge Open for review: do not merge A work in progress
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants