peda/python23-compatibility.md

# Overview

PEDA supports Python 2 and Python 3 using the
[six](https://pypi.python.org/pypi/six) library. To make sure code runs on both
Python 2 and Python 3, make sure to keep the following in mind. 

## Division

For integer division, use the `//` operator instead of `/`. In Python 3, the `/` operator returns a `float`.

In Python 3:

```python
>>> 5 / 2
2.5
>>> type(5 / 2)
<class 'float'>
```

## Type checking

To check if something is a string:

```python
isinstance(obj, six.string_types)
```

To check if something is an integer type:

```python
isinstance(x, six.integer_types)
```

## Strings

In Python 2, `bytes` is an alias for `str`. In Python 3, `str` is a unicode
type and `bytes` is used for a sequece of arbitrary bytes. Use a leading 'b' to
signify that a string is a `bytes` object.

```python
>>> 'Normal string'
'Normal string'
>>> b'arbitrary bytes \x90\x90'
b'arbitrary bytes \x90\x90'
```

To convert between `str` to `bytes`:

```python
>>> 'hi there'.encode('utf-8')
b'hi there'
>>> b'some string'.decode('utf-8')
'some string'
```

Do not mix `bytes` and `str` with each other with basic string functions. The
following is okay:

```python

>>> "abc".replace("a", "f")
'fbc'
>>> b"abc".replace(b"a", b"f")
b'fbc'
```

Mixing types in Python 3 will throw an exception:

```python
>>> b"abc".replace("a", "f")
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
TypeError: expected bytes, bytearray or buffer compatible object

```

In Python 2, indexing into a `str` returns a `str` of length 1. In Python 3, indexing into a `bytes` returns an `int`. This causes a problem when iterating. To solve this, use the `bytes_iterator` from `utils.py`.

```python
# In Python 2:
>>> s = b'hello'
>>> s
'hello'
>>> s[0]
'h'

# In Python 3:
>>> s = b'hello'
>>> s
b'hello'
>>> s[0]
104

# Solution:
>>> for c in bytes_iterator(b'hi'): print(c)
... 
b'h'
b'i'
```

## Encodings

Encode (and decode) strings into hex:

```python
>>> codecs.encode(b'abcdef', 'hex')
b'616263646566'
>>> codecs.decode('616263646566', 'hex')
b'abcdef'
```
Add support for Python 3 using the six library. Refer to python23-compatibility.md for details about supporting both Python 2 and Python 3. 2015-07-05 12:44:36 +08:00			`# Overview`

			`PEDA supports Python 2 and Python 3 using the`
			`[six](https://pypi.python.org/pypi/six) library. To make sure code runs on both`
			`Python 2 and Python 3, make sure to keep the following in mind.`

			`## Division`

			For integer division, use the `//` operator instead of `/`. In Python 3, the `/` operator returns a `float`.

			`In Python 3:`

			```python
			`>>> 5 / 2`
			`2.5`
			`>>> type(5 / 2)`
			`<class 'float'>`
			```

			`## Type checking`

			`To check if something is a string:`

			```python
			`isinstance(obj, six.string_types)`
			```

			`To check if something is an integer type:`

			```python
			`isinstance(x, six.integer_types)`
			```

			`## Strings`

			In Python 2, `bytes` is an alias for `str`. In Python 3, `str` is a unicode
			type and `bytes` is used for a sequece of arbitrary bytes. Use a leading 'b' to
			signify that a string is a `bytes` object.

			```python
			`>>> 'Normal string'`
			`'Normal string'`
			`>>> b'arbitrary bytes \x90\x90'`
			`b'arbitrary bytes \x90\x90'`
			```

			To convert between `str` to `bytes`:

			```python
			`>>> 'hi there'.encode('utf-8')`
			`b'hi there'`
			`>>> b'some string'.decode('utf-8')`
			`'some string'`
			```

			Do not mix `bytes` and `str` with each other with basic string functions. The
			`following is okay:`

			```python

			`>>> "abc".replace("a", "f")`
			`'fbc'`
			`>>> b"abc".replace(b"a", b"f")`
			`b'fbc'`
			```

			`Mixing types in Python 3 will throw an exception:`

			```python
			`>>> b"abc".replace("a", "f")`
			`Traceback (most recent call last):`
			`File "<stdin>", line 1, in <module>`
			`TypeError: expected bytes, bytearray or buffer compatible object`

			```

			In Python 2, indexing into a `str` returns a `str` of length 1. In Python 3, indexing into a `bytes` returns an `int`. This causes a problem when iterating. To solve this, use the `bytes_iterator` from `utils.py`.

			```python
			`# In Python 2:`
			`>>> s = b'hello'`
			`>>> s`
			`'hello'`
			`>>> s[0]`
			`'h'`

			`# In Python 3:`
			`>>> s = b'hello'`
			`>>> s`
			`b'hello'`
			`>>> s[0]`
			`104`

			`# Solution:`
			`>>> for c in bytes_iterator(b'hi'): print(c)`
			`...`
			`b'h'`
			`b'i'`
			```

			`## Encodings`

			`Encode (and decode) strings into hex:`

			```python
			`>>> codecs.encode(b'abcdef', 'hex')`
			`b'616263646566'`
			`>>> codecs.decode('616263646566', 'hex')`
			`b'abcdef'`
			```