on the Curl Web Content Markup Language

on the Curl Web Content Markup and Programming Language from www.curl.com and www.curlap.com
Showing posts with label Nihon-go. Show all posts
Showing posts with label Nihon-go. Show all posts

Saturday, April 28, 2012

Tuttle Kanji Flashcard app


At http://www.aule-browser.com/kanji/tuttle-henshall-kanji.html there is now a Curl applet for the Tuttle kanji cards ( I am still on the red flashcard set ! )

I used this card as a reminder of the Bayware "Power" Kanji software for Windows 95 later marketed by Transparent Language as Kanji Power2 or some such.  That software had some real strengths in its visual interactive approach to syntax and post-position particles for Who-Where-When-verb basics and modifier-before-modified.

The strength of the Tuttle cards is that each flashcard includes four 2-Kanji combinations with the Kanji of that card.  At my age, my quarrel is with the small font for these - also an issue with the O'Neill paperback.

Font-size is even an issue in Firefox with the Perapera plugin as using CTRL-+ for the browser window content causes the plugin's popups to explode in size.

The Curl* applet illustrated above includes the UNICODE value and the Henshall number.

Request a variant, and I can produce it in about an hour (skip-code, Nelson etc) so long as the information is in the Kanjidic2 XML file.

I will post a download link for the CSV file for the Tuttle flash card set for anyone using Excel or the like.  Now to go look on-line for the "green" set ...

* Curl requires the Curl browser plugin from curl.com or curlap.com (in Japanese.) Curl is in version 8.0 and is very secure and my preferred alternative to Java as both language and browser plugin.  I will add desktop versions today or soon.



Wednesday, April 25, 2012

Kanjidic2 as CSV in HTML and text


To aid a neutral party in assessing approaches to digital dictionaries for Japanese, I have posted an HTML file displaying 10,000+ of the first entries in Kanjidic2 at
  http://kanji.aule-browser.com/kanjidic2-m12.html
I have restricted the dump to the Kanji, the UCS code and a max of 12 of a possible 14 meanings.

There are less than 10,200 due to the fact that in the first 12,155 entries, many had no XML meaning content which was not assigned a language attribute.  Those few thousand may have English translations in markup previously used for foreign languages.

The file can be found as
  http://kanji.aule-browser.com/kanjidic2-m12.csv
with a three line header which you may have to alter for your purposes.

The Kanjidic2 XML file was parsed using the Curl XDM library from curl.com (Nihon-go http://www.curlap.com)

As it stands, the HTML file should be useful for building custom Anki flashcards (themselves stored as SQLite.)   I will be using variant CSV output to construct dictionary software with annotations and spaced-repetition options.  Curl has both CSV and SQLite libraries in addition to the XML libraries.