Performing string operations with chararray
NumPy has a specialized
chararray
object, which can hold strings. It is a subclass of ndarray
, and has special string methods. We will
download a text from the Python website and use those methods. The advantages of chararray
over a normal array of strings are as follows:
Whitespace of array elements is automatically trimmed on indexing
Whitespace at the ends of strings is also trimmed by comparison operators
Vectorized string operations are available, so loops are not needed
How to do it...
Let's create the character array.
Create the character array.
We can create the character array as a view:
carray = numpy.array(html).view(numpy.chararray)
Expand tabs to spaces.
Expand tabs to spaces with the
expandtabs
function. This function accepts the tab size as argument. The value is8
, if not specified:carray = carray.expandtabs(1)
Split lines.
The
splitlines
function can split a string into separate lines:carray = carray.splitlines()
The following is the complete...