Character Translation using Python (like the tr command)

PythonPerlTrTransliteration

Python Problem Overview


Is there a way to do character translation / transliteration (kind of like the tr command) using Python?

Some examples in Perl would be:

my $string = "some fields";
$string =~ tr/dies/eaid/;
print $string;  # domi failed

$string = 'the cat sat on the mat.';
$string =~ tr/a-z/b/d;
print "$string\n";  # b b   b.  (because option "d" is used to delete characters not replaced)

Python Solutions


Solution 1 - Python

See string.translate

import string
"abc".translate(string.maketrans("abc", "def")) # => "def"

Note the doc's comments about subtleties in the translation of unicode strings.

And for Python 3, you can use directly:

str.translate(str.maketrans("abc", "def"))

Edit: Since tr is a bit more advanced, also consider using re.sub.

Solution 2 - Python

If you're using python3 translate is less verbose:

>>> 'abc'.translate(str.maketrans('ac','xy'))
'xby'

Ahh.. and there is also equivalent to tr -d:

>>> "abc".translate(str.maketrans('','','b'))
'ac' 

For tr -d with python2.x use an additional argument to translate function:

>>> "abc".translate(None, 'b')
'ac'

Solution 3 - Python

I has developed python-tr, implemented tr algorithm. Let's try it.

Install:

$ pip install python-tr

Example:

>>> from tr import tr
>>> tr('bn', 'cr', 'bunny')
'curry'
>>> tr('n', '', 'bunny', 'd')
'buy'
>>> tr('n', 'u', 'bunny', 'c')
'uunnu'
>>> tr('n', '', 'bunny', 's')
'buny'
>>> tr('bn', '', 'bunny', 'cd')
'bnn'
>>> tr('bn', 'cr', 'bunny', 'cs')
'brnnr'
>>> tr('bn', 'cr', 'bunny', 'ds')
'uy'

Solution 4 - Python

In Python 2, unicode.translate() accepts ordinary mappings, ie. there's no need to import anything either:

>>> u'abc+-'.translate({ord('+'): u'-', ord('-'): u'+', ord('b'): None})
u'ac-+'

The translate() method is especially useful for swapping characters (as '+' and '-' above), which can't be done with replace(), and using re.sub() isn't very straightforward for that purpose either.

I have to admit, however, that the repeated use of ord() doesn't make the code look like nice and tidy.

Solution 5 - Python

We build a map and then translate letter by letter. When using get for dictionary then the second argument specifying what to return if not find anything.

It could be easily transferred to separate function. Mostly should be very efficient.

def transy(strin, old, new):
  assert len(old)==len(new)
  trans = dict(zip(list(old),list(new)))
  res =  "".join([trans.get(i,i) for i in strin])
  return res

>>> transy("abcd", "abc", "xyz")
'xyzd'

Solution 6 - Python

A simpler approach may be to use replace. e.g.

 "abc".replace("abc", "def")
'def'

No need to import anything. Works in Python 2.x

Attributions

All content for this solution is sourced from the original question on Stackoverflow.

The content on this page is licensed under the Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) license.

Content TypeOriginal AuthorOriginal Content on Stackoverflow
QuestionhhafezView Question on Stackoverflow
Solution 1 - PythonRichard LevasseurView Answer on Stackoverflow
Solution 2 - PythonPiotr CzaplaView Answer on Stackoverflow
Solution 3 - PythonYukino IkegamiView Answer on Stackoverflow
Solution 4 - PythonlenzView Answer on Stackoverflow
Solution 5 - PythonpolkasView Answer on Stackoverflow
Solution 6 - PythonChris HaagView Answer on Stackoverflow