This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Linux 0 | |
Таблица МФА для английского языка 1 | |
Международный фонетический алфавит 2 | |
Английский язык 3 | |
Англо-фризские языки 4 | |
Западногерманские языки 5 | |
Германские языки 6 | |
Индоевропейские языки 7 | |
Языковая семья 8 | |
Язык 9 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
DATA = [ | |
[ | |
[['Who', 'is', 'Shaka', 'Khan', '?'], ['O', 'O', 'I-PER', 'I-PER', 'O']] | |
], | |
[ | |
[['I', 'like', 'London', 'and', 'Berlin', '.'], ['O', 'O', 'I-LOC', 'O', 'I-LOC', 'O']] | |
] | |
] | |
with open('output.conll', 'w') as f: |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# coding: utf8 | |
from __future__ import unicode_literals | |
import gzip | |
import plac | |
import math | |
from tqdm import tqdm | |
import spacy |