Examples¶

This section includes task-oriented snippets using kotobase

from kotobase import Kotobase

# Every method opens its own `read-only session`,
# so a single shared instance is enough
kb = Kotobase()

Results

All returned objects are typed, immutable DTOs that do not depend on an open session, so they are safe to keep, pass around and serialize

Comprehensive Lookup¶

lookup Aggregates Every Source Into One Result

result = kb.lookup("日本語")
result = kb("日本語")  # (1)!

for entry in result.entries:
    print(
        f"Written Form : {entry.headword}",
        f"High-Frequency Word ? : {entry.is_common}"
        )
    for sense in entry.senses:
        print(f"Meanings : {''.join(g.text for g in sense.glosses)}")

for kanji in result.kanji:
    print(
        f"Literal: {kanji.literal}",
        f"Meanings: {kanji.meanings}"
        )

Alias For kb.lookup

Options¶

result = kb.lookup(
    "食べ*",
    wildcard=True, # (1)!
    include_names=True, # (2)!
    sentence_limit=10,  # (3)!
    with_labels=True,  # (4)!
)
print(result.labels["sl"])

Treat * As A Wildcard
Include Proper Names From JMNedict
Return 10 Example Sentences + Translations
Resolve JMDict / JMNedict Tag Codes To Their Descriptions (sl -> slang)

Search Kanji¶

Filter Kanji By Scalar Attributes, Or Look Them Up By SKIP Code

n5 = kb.search_kanji(jlpt=5, limit=50)  # (1)!
eight_strokes = kb.search_kanji(stroke_count=8, grade=2) # (2)!
by_skip = kb.kanji_by_skip("1-4-3") # (3)!

First 50 Kanji Listed In Tanos' N5 JLPT List
Only Kanji That Have 8 Strokes And Are Learned In The Second Grade
Kanji That Are Vertically Split Into Left / Right Parts (1-), Where The Left Part Has 4 Strokes (1-4), And The Right Part Has 3 Strokes (1-4-3) (e.g 那). Read More About The System of Kanji Indexing by Patterns

Radicals¶

Find Kanji Which Contain Certain Radicals

radicals = kb.radicals()  # (1)!
matches = kb.by_radicals(["言", "五"]) #  (2)!

View Every Search Radical
Find All Kanji That Contain Both 言 + 五 Radicals (e.g 語)

Proper Names¶

tanaka = kb.names("田中")  # (1)!
places = kb.names(name_type="place")  #(2)!

Search A Proper Name Entry
Search By Name Type

Search By Meaning¶

Find Entries From Their English Gloss Using Full Text Search

for entry in kb.search_meaning("to eat", limit=10):
    print(entry.headword)

Example Sentences¶

Find Example Sentences Containing A Given Text

for sentence in kb.sentences("日本", limit=5):
    print(sentence.text)
    for translation in sentence.translations:
        print("  ", translation)

Furigana¶

View Furigana Segmentation For A Given Word

for item in kb.furigana("食べる"):
    print(item.reading, item.segments)

JLPT¶

Browse Tanos JLPT Study Lists

level = kb.jlpt_level("勉強")

vocab = kb.jlpt_list("vocab", 5)
kanji = kb.jlpt_list("kanji", 5)
grammar = kb.jlpt_list("grammar", 2)

Stroke Order¶

Access Stroke Order SVGs

svg = kb.stroke_svg("春") # (1)!
raw = kb.stroke_svg("春", raw=True) # (2)!
if svg is not None:
    open("haru.svg", "w", encoding="utf-8").write(svg)

A Renderable SVG Document
The Raw KanjiVG fragment

Audio¶

Access Pronunciation Audio From The Optional Audio Pack

clips = kb.audio("語")  # (1)!
for clip in clips:
    print(clip.reading, clip.fmt, clip.source, clip.license)

files = kb.audio_bytes("語")  # (2)!
name, data = files[0]

paths = kb.save_audio("語", "clips")  # (3)!

Clip Metadata, Without The Bytes
Each Clip As A (file_name, bytes) Pair
Write Every Clip Into clips/ And Return The Written Paths

Needs The Audio Pack

These Raise AudioDatabaseNotFoundError When The Optional Audio Pack Is Not Installed (kotobase db pull --with-audio)

Expanding Tag Codes¶

Exapand JMDict / JMNedict Tag Codes To Their Full Descriptions

labels = kb.expand_tags(["sl", "n", "vs"])
# {"sl": "slang", "n": "noun (common) (futsuumeishi)", ...}

Serialization¶

Every Result Object Can Be Turned Into A Plain Dictionary Or JSON With Japanese Text Kept Verbatim

result = kb.lookup("語")
data = result.to_dict()
text = result.to_json()
for field, value in result:   # iteration yields (field, value) pairs
    ...

Database Metadata¶

info = kb.db_info()
print(info["build_date"], info["size_mb"])

Using Kotobase Concurrently¶

The Read Layer Is Thread-Safe, So An async Application Can Run A lookup Off The Event Loop Without Blocking It

import asyncio


async def main() -> None:
    result = await asyncio.to_thread(kb.lookup, "日本語")
    print(result.query)


asyncio.run(main())