/ Coumputer / Programming

Web API & Dictionary & Dictionaries API

Web API
Flight APIs
Dictionary & Dictionaries API
Dictionaries
Wiktionary
Wiktionary Text Parser
دیکشنری
Словари
Google Translate API
Google APIs

Web API

Test Web API using Fiddler or Postman

Flight APIs

9 Flight APIs You Can Create Apps on to Get Live Flight Data | itnext.in

Tickets - Core API - Zendesk Developer Portal

Dictionary & Dictionaries API

ABBYY Lingvo Live
Babylon
Babylon Dictionary
Baidu
DeepL
Google Search
Google Translate
ImTranslator
Microsoft Translator
Multitran
Naver
Oxford Learner Dictionary
Promt
Reverso
Services
Urban Dictionary
Wikipedia
WordReference
Yandex
youdao

فارسی

Pearson Longman: Interactive Phonemic Chart

special - Definition from Longman English Dictionary Online

Dictionaries

Cambridge Free English Dictionary and Thesaurus

Online dictionaries by bab.la - loving languages

Wiktionary

Mirrors of XML dumps, images and other data

Index of /mirror/wikimedia.org/dumps

Index of /enwiktionary/latest/

windows - Download all words from Wiktionary for offline use - Software Recommendations Stack Exchange

Index of /ruwiktionary/

Wikimedia Downloads

Google Code Archive - Long-term storage for Google Code Project Hosting.

Wiktionary:Download – Wiktionary

Tech Yandex

Slovari Yandex by

The Wiktionary API can be used to query whether or not a word exists.

Examples for existing and non-existing pages:

http://en.wiktionary.org/w/api.php?action=query&titles=test http://en.wiktionary.org/w/api.php?action=query&titles=testx

The first link provides examples on other types of formats that might be easier to parse.

To retrieve the word's data in a small XHTML format (should more than existence be required), request the printable version of the page:

http://en.wiktionary.org/w/index.php?title=test&printable=yes
http://en.wiktionary.org/w/index.php?title=testx&printable=yes

https://www.mediawiki.org/wiki/API:Tutorial

wiktionarylookup

API дела (Russian)

mediawiki-api-announce mailing

Wiktionary Text Parser

*** Update 1. Wikparser will now extract synonyms from entries. ***
*** Update 2. Wikparser will now extract hypernyms from entries. ***
*** Update 3. Wikparser is now available on GitHub. ***
*** Update 4. Wikparser rewritten. New languages and functionality added. ***
Not many web-based dictionaries lend themselves to secondary research projects. English has Wordnet, but if you’re interested in working with other languages, your options are limited.
Because I work primarily with the French lexicon (mostly on compounds and collocations), I needed access to a lexicographic repository that would allow me to extract and manipulate specific information, such as lexical categories, definitions, synonyms, etc. Although Wiktionary (a Wikimedia project) is not a perfect lexicographic resource, it is available in many different languages and, more importantly, it offers an API that allows for direct connections to its database. The results, however, cannot be automatically parsed. For instance, you can’t tell the API that all you want are definitions. It is only able to return the raw data for the entire entry. That’s why I wrote Wikparser.
What is it?
This small tool was developed so as to be able to extract specific information from a Wiktionary entry. It is for research purposes only. This means that the output consists of nothing but text and is meant for storage or processing purposes only. It’s not meant to be pretty.
What does it do?
The current version is able to extract lexical categories (POS for part of speech), definitions and synonyms, hypernyms, and gender.
What languages are supported?
Currently, English and French are fully supported. Spanish and German parsing is also supported, but functionality is slightly more limited. You can add support for other languages by following the instructions at the bottom of this page.
How do I use it?
You have two options:
You may use the parser hosted on this site.
You may download the software (PHP) and run it on your own web server.
Option 2 is recommended, but let me start by explaining how it works. Please keep in mind that I’m calling this version a 0.2 release.
1. Instructions
The script is called by pointing your browser or other software (e.g. Google Refine) to URL/wikparse.php. The following parameters with their corresponding values must be submitted as GET values (* indicates obligatory parameters):
*word: any string
*query: pos for parts of speech; def for definitions; syn for synonyms; hyper for hypernyms
lang: Use whatever language code Wiktionary uses. Script currently supports English (en), French (fr), Spanish (es), and German (de) natively [default: en]. To add support for other languages, see below.
count: number of items to return [default: 100]
source: local if you are running a MySql copy of the Wiktionary locally; api if you wish to use Wiktionary’s API [default: api]
Here are a few examples using the script hosted on this server:
http://www.igrec.ca/project-files/wikparser/wikparser.php?word=table&query=def&count=2
http://www.igrec.ca/project-files/wikparser/wikparser.php?word=puissance&query=pos&lang=fr
Both of the queries above return results by using Wiktionary’s API by default.
If you opt for option 1 (i.e. using the software hosted here, as in the above examples), keep the following in mind: it’s slow. Because calls to the Wiktionary API require a bot login account to lower the delay between connections, requests will eventually timeout if you run the script too quickly. According to Wiktionary’s documentation, an appropriate delay between calls is roughly 5 seconds (lower if you’ve registered a bot account). In order to not have my IP banned, I have set the delay to exactly 5 seconds. If you attempt to reconnect any faster than that, you will receive an error message.
Clearly, this is not ideal if you’re looking to gather a lot of data. For example, for every 1000 words requested, the script will take nearly 90 minutes to complete. Of course, if you want to use your own machine to connect to the Wiktionary API, you might get away with a lower delay, thus increasing the speed of your data collection.
2. Using the parser locally
Requirements:
Apache or some other web server platform
PHP 5 (script will not work with PHP 4)
cURL (uncomment “extension=php_curl.dll” in your php.ini file if it isn’t already enabled)
Optional: if you wish to increase the speed of your queries, you’ll want to have a Wiktionary dump running on MySql locally. Some changes must be made to certain files. Instructions on how to do this below.
Steps:
Download Wikparser 0.3 from Github (Updated 22 Oct. 2014)
Extract files into a directory on your web server.
Point your browser to the file and set parameters via the URL
If you’re only interested in extracting data for English or French words (and to some extent Spanish and German), that’s all there is to it. Feel free to set the delay between requests to whatever you’d like, but know that the Wiktionary API will eventually return errors if the delay is too short.
3. Downloading and installing a local copy of Wiktionary
This requires a bit of work. I might make a post in the future detailing the process (Done), but in the meantime, you can follow this tutorial by Dave Shaw. If you’re not interested in an English copy, you can download the dump for your language via MediaWiki’s backup index. Once you’ve created the tables, I suggest you set the collation to ‘utf8_bin.’
Important: When using the mwdumper.jar tool to import the XML file into your MySQL database, you should add “–default-character-set=utf8” at the end of the command if you’re using any language other than English. Click here for more info on using the mwdumper tool. Please note that importing the Wiktionary XML into a MySQL database takes some time (a few hours). You can speed up the import dramatically by removing all indexes from the tables after you’ve created the database using the appropriate SQL queries. Simply follow the tutorial linked above, but before running the mwdumper, go into the database and remove all indexes from the page, revision, and text tables. Once you’re done importing, you can simply re-enable the indexes by running the corresponding SQL index queries for each table by copying them from original script.
If you’ve succeeded in installing a local copy of the Wiktionary database, you now need to edit the conc.php file in the classes folder according the database information you setup (the four variables that need to be modified are labelled in the file). You can then use wikparser.php as you would otherwise, except that now you must set the source parameter to ‘local’ (e.g. …/wikparser.php?word=big&query=pos&source=local).
If installing Wiktionary locally doesn’t seem worth the trouble, well, trust me, it is. You can easily query 10 words per second, if not more. This means that extracting data for 10,000 words takes about 15 minutes. If you respect the 5 second wait between requests to the Wiktionary API, however, this same task would take nearly 14 hours.
4. Adding support for other languages
In order to add support for other languages, you must first determine the language code used by Wiktionary. It’s usually the standard two letter code, but you can always check by going to wiktionary.org and selecting the language you’re interested in. Then look at the first few letters of the URL:
http://tr.wiktionary.org/ : tr for Turkish
http://vi.wiktionary.org/ : vi for Vietnamese
Now open the language.config.php file in the root of the Wiktionary Parser. You’ll see a PHP switch. You must add a new case (or modify one of the ones included if you don’t care about keeping English or French functionality) for the language you want to work with. You’ll see the following:
case "INSERT LANGUAGE CODE HERE":
$langParameters = array(
"langCode" => "",
"langHeader" => "",
"langSeparator" => "",
"defHeader" => "",
"defTag" => "",
"synHeader" => "",
"hyperHeader" => "",
"genderPattern" => "",
"posMatchType" => "",
"posPattern" => "",
"posArray" => "",
"posExtraString" => "",
);
For instance, if you’re working with Turkish, you would insert tr between the case quotes. As for the rest, you’ll need to actually have a look at the output generated by the Wiktionary API (the output is also identical for a local copy of the database). You’ll need to call the API with a word and look at the output to identify each one of the parameters above. Here’s an example of the output for the word abuelo via the Wiktionary API using the spanish language code (es):
http://es.wiktionary.org/w/api.php?action=parse&prop=wikitext&page=abuelo&format=xmlfm
You’ll need to scan multiple words to determine what patterns to use for whatever language you’re interested in. It can be tricky, as the raw data is messy and inconsistent. You’ll often find identifiers that differ from one entry to the next. Once you’ve figured out how Wiktionary encodes its data for that language, you can begin to fill in the parameters. Not all parameters need to be set for the parser to work; if you’re only interested in extracting synonyms, then only synHeader requires a value. One by one:
langCode: The string that identifies the language within Wiktionary (e.g. en, de, tr, etc.)
langheader: The string that identifies the section for whatever language you’re working with. Wiktionary will often list multiple languages on a page for a given word (table, for instance, is both valid in English and French). It’s important to identify the string that starts a language section so that info from another language isn’t parsed. Ex. ==English==.
langSeparator: The string that separates each language on a given page. Sometimes it’s a simple string (e.g. “—-” in the English Wiktionary), but in other cases you might have to use a partial string. For example, the French Wiktionary wraps languages within “== {{=fr=}} ==”, so we can assume that each new language section will begin with “== {{=”. This is therefore the langSeparator for French entries.
defHeader: The string that begins the definitions section. Not always present (e.g. English). In German, all definitions fall under the {{Bedeutungen}} string.
defTag: Definitions are usually preceded by some non-alphanumeric character (e.g. in English by “# ” (notice the space)). This differs between languages, however.
synHeader: String that identifies the synonyms section (e.g. English: ====Synonyms====).
hyperHeader: String that identifies the hypernyms section (e.g. English: ====Hypernyms====).
genderPattern: A regular expression that captures a words gender. Patterns used are often inconsistent, so you’ll need to go through a few pages to make sure you’ve identified all possible strings.
posMatchType: Either “array” or “preg”. This is how the parts of speech will be identified. If, like for English, there is a limited number of possibilities, you can simply store them in an array and set this variable to “array”. If the parts of speech vary greatly (like they do for French), then you’ll want to use a regular expression and set this variable to “preg”.
posPattern: If the parts of speech vary greatly, you’ll need to write a regular expression in order to identify them. If you’re unfamiliar with regular expressions, have a look at this quick guide, which also has a link to a tutorial.
posArray: If the parts of speech do not vary and are limited in number, you can store them all in this array and set the posmatchtype variable to “array.”
posExtraString: When using regular expressions to match POS, you often need to add unrelated strings in order to capture the correct entry (e.g. in German, POS is preceded by {{Wortart|). Add this string here to have the parser strip at output.
Once these parameters are set, you should be able to call the script with the new language code set to the lang parameter.
igrec.ca - wiktionary-text-parser

Our Top 6 Best Dictionary APIs for 2019

WordsAPI Best for All Around Information
SYSTRAN Translation API Best for Multilingual Information
Urban Dictionary API Best for Slang & Colloquial Definitions
Twinword Word Dictionary Best for Definitions, Associations & Examples
Tagdef Best for Hashtags
Linguatools English Collocations Best for Collocations

Google APIs

Sentence translation:

https://translate.googleapis.com/translate_a/single?client=gtx&ie=UTF-8&oe=UTF-8&dt=bd&dt=ex&dt=ld&dt=md&dt=rw&dt=rm&dt=ss&dt=t&dt=at&dt=qc&sl=auto&tl=fa&hl=en&q=Попули́зм

Word translation(Dictionary):

        var url = `https://www.google.com/search?hl=${LANGUAGE}&q=define+${info.word}`
        alert(url);
		var xmlHTTP = new XMLHttpRequest();
        xmlHTTP.responseType = 'document';
        xmlHTTP.onload = createCallback();
        xmlHTTP.open( "GET", url, true ); // true for asynchronous request
        xmlHTTP.send();

Web API & Dictionary & Dictionaries API

Web API

Flight APIs

Dictionary & Dictionaries API

Dictionaries

Wiktionary

Wiktionary Text Parser

Our Top 6 Best Dictionary APIs for 2019

دیکشنری

Словари

Google Translate API

Google APIs

Articles

Management

Studies

Language