Ucto is a rule-based tokeniser for multiple languages. It does sentence boundary detection as well. Although it is written in C++, there is a Python binding python-ucto to interface with it.
import ucto
#Set a file to use as tokeniser rules, this one is for English, other languages are availabl...