SMPP needs to have a pluggable codec class. #836

smn · 2014-08-08T15:21:01Z

It is possible for MNOs to give us messages that are in an encoding that aren't in the standard encodings that Python ships with.

We have two options:

Register a new codec with the codec registry.
Create a pluggable codec class for SMPP that fallbacks to python's codec registry for things it knows about but provides hooks for providing other encodings.

We've chosen to go with option 2 because adding a codec to the registry introduces all sorts of potential code loading race conditions.

smn · 2014-08-08T09:57:13Z

This would solve #338 for SMPP.

smn · 2014-08-12T09:08:50Z

@rudigiesler @justinvdm @hodgestar can I get a review?

rudigiesler · 2014-08-12T09:47:35Z

👍, but I don't know the Vumi code-base well enough to know if this might mess with something else.

justinvdm · 2014-08-12T09:58:25Z

vumi/transports/smpp/processors/default.py

We implemented a ucs2 codec in this PR that proxies to the utf-16be codec. Should we maybe refer to it here?

Yeah, my thinking was to have the map in the DeliveryShortMessageProcessor be only codecs that already exist in the standard python distribution. The configurable codec_class can override some of these if it wants, which it does with the ucs2 implementation it provides.

I'm ±0 on this really, it seemed a sensible thing to do yesterday but happy to stick in a custom codec here as well.

@hodgestar thoughts?

Ah, ok, that works.

I think sticking to the built-in codecs by default makes sense for now -- we can shift things around easily later.

justinvdm · 2014-08-12T10:24:28Z

Minor comment, otherwise looks good.

justinvdm · 2014-08-12T10:31:54Z

👍

smn · 2014-08-12T10:36:50Z

@rudigiesler thanks for the review, I don't expect you to know everything or be catching problems. The best way to learn is by reading PRs (which is why we're constantly asking you to review stuff).

hodgestar · 2014-08-12T10:47:00Z

vumi/codecs/vumi_codecs.py

Seem like performance on this would be terrible. Should we construct a reverse mapping?

Good idea, let me start on that.

hodgestar · 2014-08-12T10:50:56Z

vumi/codecs/vumi_codecs.py

hodgestar · 2014-08-12T10:53:41Z

I left a bunch of questions but they're mostly just to check my understanding of the changes.

hodgestar · 2014-08-13T11:07:27Z

I'm wondering whether we should land this and then sort out the 7-bit packing in a separate PR? We should start that PR by adding an integration test that sends in a 7-bit packed PDU and checks that we handle it correctly?

👍 on this PR landing once we have the ticket for the new one.

smn · 2014-08-14T07:45:30Z

@hodgestar GSM7Bit now returns bytestrings, I'm reasonably convinced that I saw that that is what we were receiving anyway.

Also added handling of errors kwarg somewhat properly.

hodgestar · 2014-08-14T07:55:08Z

vumi/codecs/vumi_codecs.py

Does this perhaps need to be return self.gsm_basic_charset_map.get('?')?

I guess we should have tests for the error cases?

good catch!

smn · 2014-08-14T08:24:47Z

@hodgestar ready for rereview

smn · 2014-08-14T08:54:03Z

@rudigiesler @justinvdm also again ready for re-revieww

hodgestar · 2014-08-14T08:59:13Z

vumi/codecs/vumi_codecs.py

I noticed another issue here -- we call the same error handlers for both encoding and decoding, but I don't think that makes sense:

For decoding, handle_replace_error needs to return u'?'.

For decoding and encoding, handle_strict_error should raise UnicodeDecodeError or UnicodeEncodeError as appropriate.

I had a feeling this was a bad idea to begin with.

@hodgestar

…nks @hodgestar)

smn · 2014-08-14T09:38:37Z

@hodgestar ok, again :) Not entirely happy with some of the duplication but it's not too bad.

hodgestar · 2014-08-14T09:48:57Z

vumi/codecs/tests/test_vumi_codecs.py

This should check that UnicodeEncodeError is raised (and the equivalent decode test should have a similar change).

thanks, done.

hodgestar · 2014-08-14T09:49:19Z

Other than one small comment, looking good.

…hodgestar

…hodgestar)

hodgestar · 2014-08-14T09:56:08Z

👍 as soon as a Travis build passes (looks like they're building now).

smn added 6 commits August 8, 2014 16:23

initial layout

09c06a0

mimick codec.[en|de]code behaviour with default encoding

38d0c5b

update test for default encoding option

aa640e6

module naming conflict with pythons codecs module

46eb895

UCS2 codec

1d0075c

start on GSM 03.38 codec

b73aff5

smn added in progress labels Aug 8, 2014

smn added 10 commits August 8, 2014 17:27

fix default config values to make tests pass

0f87d68

Merge branch 'develop' into feature/issue-836-smpp-encoding-class

2d6946d

hotwiring the SmppCodec

ee7a2c9

move into vumi.codecs module

7820ac8

remove the need for decode_pdus (refactor cat!)

c467c97

pluggable SMPP codec class

f6e8289

rename SmppCodec -> VumiCodec

1f1995b

catch a UnicodeDecodeError, not all Exceptions

aab861c

typo

69e639d

keep original behaviour of returning the original object to be decoded

204aba0

smn added please-review and removed in progress labels Aug 11, 2014

justinvdm reviewed Aug 12, 2014
View reviewed changes

hodgestar reviewed Aug 12, 2014
View reviewed changes

vumi/codecs/vumi_codecs.py

Copy link

Contributor

hodgestar Aug 12, 2014

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

\o/

smn added 2 commits August 14, 2014 09:39

stop hexing the gsm7bit, raw bytestrings instead (thanks @hodgestar)

feff18e

test for extended GSM set

3ff0cb2

pep8 fixes

3686e10

smn mentioned this pull request Aug 14, 2014

GSM7Bit SMPP codec should be able to fallback to UCS2. #838

Open

hodgestar reviewed Aug 14, 2014
View reviewed changes

tests for error handling

39d2498

hodgestar reviewed Aug 14, 2014
View reviewed changes

split encode & decode error handling and raise proper exceptions (tha…

e064733

…nks @hodgestar)

hodgestar reviewed Aug 14, 2014
View reviewed changes

test for UnicodeDecode/EncodeErrors instead of UnicodeError (thanks @…

7a86f39

…hodgestar)

smn merged commit 7a86f39 into develop Aug 14, 2014

SMPP needs to have a pluggable codec class. #836

SMPP needs to have a pluggable codec class. #836

Uh oh!

Conversation

smn commented Aug 8, 2014

Uh oh!

smn commented Aug 8, 2014

Uh oh!

smn commented Aug 12, 2014

Uh oh!

rudigiesler commented Aug 12, 2014

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

justinvdm commented Aug 12, 2014

Uh oh!

justinvdm commented Aug 12, 2014

Uh oh!

smn commented Aug 12, 2014

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hodgestar commented Aug 12, 2014

Uh oh!

hodgestar commented Aug 13, 2014

Uh oh!

smn commented Aug 14, 2014

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

smn commented Aug 14, 2014

Uh oh!

smn commented Aug 14, 2014

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

smn commented Aug 14, 2014

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hodgestar commented Aug 14, 2014

Uh oh!

hodgestar commented Aug 14, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants