This is an online diary of my journey to the fascinating world of languages. I started it in June 2016 with my Farsi blog. Five months later I discovered Somali for myself, so I wrote the entry at the bottom and decided to open the blog for all language experiences and my thoughts about them. My first language is German, followed by English. At the moment, my French is overriding my Arabic, but usually Arabic is my third language, both standard and dialect(s). I have a general interest in many other languages and their histories. My hope is to inspire people and to trigger some déjà vu moments.
What about mixing up languages and getting confused like Babel? I do have this nagging doubt, too, and thus monitor my learning in this respect. Like, when I do Spanish and then I do Portuguese and Italian shortly afterwards. So far it is OK, but I do not mix Spanish and Portuguese on the same day, that would be too much. Also, in focus weeks I will not look at a similar language, but maybe at a very different one for an evening or an hour. Normally, when I am in a particular language I do not think of the others. I focus on the material, not on the sky, so to speak. Maybe the trouble will start later, I don't know. - It is different when I talk. When I collect my Farsi for a short sentence to some acquaintance in the street there might be Turkish words surfacing in my mind instead. But this is because I hardly talk my languages at this stage, knowing, however, that an activation of this passive knowledge will be much much easier than starting from scratch. Plus: Whenever I have the opportunity to speak - and be it a single word - I do. A couple of weeks ago I met a Sorat (Syrian/(Neo-)Aramaic) speaker and remembered "hamzimmin" and "itachrin", and he was really amazed and I laughed and it was fine. I wanted to look up the video I listened to twice to recall the meanings of these words :-) It's OK, I think the video is no longer online. For an hour I had been in that language, like taking a bath in a sea abroad.
Steps. It is about steps and then levels, for each language. Small steps, smallest steps. A six-hour day with Spanish is a step. I can feel how my Italian is much better than my Spanish. In Spanish I still chew on easiest words, and the pronunciation is more complex and deviating from the written version. Following a written text with translation is relatively easy, though, even in Portuguese. I do not think much about steps and levels when I am at it, I just enjoy the beautiful strangeness of my language of today. At one point, the need to see the conjugations of verbs and other grammar things will appear. At another point one will feel the need to go through grammar systematically, or one reads some grammar without the need and rediscovers many things one has just experienced in the texts, like preposition-pronoun-compounds or the like. This can be huge fun with unexpected déjà vus because it is like a subsequent structuring of the chaos in the head without much effort.
Once we understand that we are limited by the time of our terrestrial presence and that we will never know as much as we want, it becomes easier to relax and spend an hour or two with a language we never had any relation with. I think I did this about ten times now, things like Irish (a Celtic language), Armenian (an independent branch of Indo-European, saw it on "Easy Armenian"), Old Egyptian, Old English/German, Sorat, Pashto ... It is like trying out a new restaurant. My metaphors are to express the number one factor for succesful learning - as opposed to this idiotic school learning - which is joy. The formula question thus is: How do I combine language X with my feelings of joy and entertainment? When you found your answer, you've got everything it takes.
By the way: I just re-read this whole page and its predecessor, the Farsi blog, and realized that it is "only" three years and three months back that I rediscovered language learning. Before I had been confined to German, English and Arabic. The rest had died off, it wasn't much, anyway. And today! This is so different.
The gentleman on the left is responsible for my fluent English. Voilà. At the age of eleven I became first interested in Elvis' music, and ten years later I was pretty fluent in English. In retrospective I would say the main reason was love for his voice and his music. That's the love factor, immensely important in language learning. Love is the most powerful resource and bursts all measures if necessary. Also straight-forward is the repetition factor. You listen to a song until you know every aspect about it by heart, and then you still listen on. And it is not that you want to learn anything, you just want to listen to this - whoever the artist may be in your case. Factor three is more specific, it is about the glue between word and music. The two melt into something new, something much easier to remember than mere vocabulary, even poetry. Because it is so pleasant, rich and evocative together.
Elvis recorded about 710 songs of which I know about 500 really well. Each song contains from 40 to maybe 200 words, including difficult, poetic ones. The Beatles recorded 213 songs (+ 100 live releases etc. after break-up) and my whole musical world at 21 may have included 2500 English songs, mostly 50s and 60s music. And now comes the dictionary bit. Even in school I started to use it. When the English teacher asked: What is the preposition after the verb to cling? ... a program would start running, rattling in my brain box, flashing through the internal lyrics dictionary, cling ... cling ... "It's not clinging to the rocks and ivy planted on their columns now that bind me", ah, OK, Gentle On My Mind. Three seconds, reasonable result. (I made this example up, but it went something like this.)
So when I had I real go at French not so long ago, one of the first things to do was to find as many Jacques Brel songs as possible (172), translate/analyze them and listen to them often because I love his music and voice. The rest of my music-based French dictionary is fragmentary: some Yves Montand and Edith Piaf, didn't get round to delve into Léo Ferré's vast oevre yet, and I love Fernandel's "La Bouillabaisse" from 1963. Then there is something in the same category which is non-musical, namely sound effect audio literature like Cyprien's "Épopée temporelle" and even "L'Histoire raconté par les chaussettes". And here, at the latest, Teddy will fully agree again because it is about dialogues and speech. So I admit there is an overlap between the music-based dictionary and the audio dialogue (maybe even monologue) based mental dictionary. I still hold that music has a deeper extra layer.
While thinking about all that in the above-mentioned amusement, I found this aspect of vobabulary knowledge. How well do we know words? How well do you know the word "litter"? Did you know that it can refer to a bunch of puppies? My reason for the question is the effect of the mental dictionary on how we learn words and how well we know them. When, for example, I know an Elvis song, then I usually know the lyrics really well. Not like in these cases when you do know the word, but you still look it up, just to make sure. Really knowing is to mean: Even when your best friend insists that you cannot "throw" a party, that this must be the wrong word, you have no doubt about your Jailhouse Rock that starts with "Warden threw a party in the county jail". No further discussion needed. Except maybe for colloquial developments in the past 60 years.
Stages of Vobabulary Knowledge(in progress)
So let's try to grasp this phenomenon by inventing a scale of vocabulary knowledge from zero to nine:
0. This is the easiest one: You never heard the word and cannot relate to it in any way. You couldn't say which language it is taken from, if it is a noun, a verb or what, and you understand nothing about it.
1. You had some sort of contact with the word and can say something about it. You cannot confidently give a meaning, except for easiest cases like "yes" or "and" or "we".
2. Now you know that "Stein" (pronounce: shtain) means "stone". You have no idea what a "Steinmetz" is, a "Bordsteinschwalbe" or a "Nierenstein" and you cannot deduce an adjective or a verb or a diminutive. On this level you have a fair understanding of the word if it is a spatial preposition or adverb like "in, from, up, down", and if it is a most basic verb or adjective. It corresponds with an A1 language knowledge.
3. You can actively use the word. If it is a simple word you can claim to know it on this level. If it is a complex word you will know more than one of its meanings. When you conjugate a verb or declinate a noun you will still make mistakes and thinking pauses. If it is a verb: You still don't know all the tenses. (A2 language knowledge).
4. On this level, the word is part of your repertoir. When you look it up you recognize it. You have heard and understood the word in different contexts. You have used it several times, maybe only in one specific sense. You might still forget the word in active usage and have to be reminded of it. If it is a simple word you will have a knowledge corresponding to level 7 of complex words.
5. You are firm in the general usage of the word and can give a definition. In complex words like "put" with its dozens of meanings you can estimate the scope of expressions that you still might have problems with. (B1)
6. You know the word, usually including spelling. In more complex cases you might look it up just to make sure. You have seen it in many contexts and shapes. You might not know every usage of a complex word, but you can confidently use it in speech and script with rare cases of mistakes. When you hear the word you do not have to think about its meaning. When someone uses it wrongly you will notice. You have an understanding about when and where to use the word, i.e. style and register. (B2)
7. This is level of an educated and advanced average understanding of the word in the mother-tongue. In case of verbs it contains the effortless ability of conjugation in all tenses and the correct usage of all prepositions in everyday speech and media language. In nouns, too, you know common rarer meanings and exceptional uses if there are some. (C1)
8. Here, you have a deep knowledge of the word, not only a daily usage understanding. If there are rare cases of prepositions with a special meaning, you will know it. You have seen the word in creative literary usage and you are (theoretically) able to use it nicely in a literary text yourself. If it is a rare word you will still probably know it. (C2)
9. You also know the history of the word, some earlier meanings maybe, dialect versions and some cognates in other languages. You are able to modify the word in a pun and to design a derivation, a new word.
(August 7, 2019) In the last four months I slid over to TED talks, Technology, Entertainment, Design. This popular nonprofit global thinktank with millions of views and millions of translations in subtitles has caught me. It is a development from the things I did before concerning language acquisition. Maybe (and I really don't know) my real interest is how we learn languages and not the individual languages of choice. What do toddlers do when they do what they do? Don't tell me it is a brain thing a grown-up cannot compensate. I have experienced myself how brain cells of an adult can be newly combined, newly connected. One key element is the subconscious. Learning is a kind of half-aware process. We don't learn words or grammar by focussing on them. Sometimes we do, but usually not. Rather, we listen attentively and then we reproduce. The main thing is not an effort. Let this enter: It is not an effort. Ask them. The kids. They won't even understand your question. In the past months I had experiences of listening to Spanish, Italian and other languages while reading the words, and I understood longer passages without translating, without thinking, just letting the words sink in, amazing. This is not what we learned in school. Far from it. Try this experiment: Write down the year and the number of pages of a book and try to remember the first number, explicitly deciding to forget the second number, e.g. 2006, 278. Just focus on the first number. Forget the second. Yes, it is there, but it is not important. We only need the year. Forget the 278! Sometimes when you see two three-digit numbers you might find that you remember them without the plan to do so.
So TED. Today I think that citybooks (see below) was a harbinger. The multilingual texts with audios are great, but maybe too literary and a bit too abstract for my level. Premature. TED talks bring you into live monologues of about a quarter of an hour (8-20 mins). The length is a plus as it is neither too short nor too long. The subtitles are a plus because most talks have subtitles at least in the language in which they are presented. I saw a Finnish one, though, with only an English translation, without Finnish subs; basically useless for me. Further on the plus side is the authenticity of the pieces. Then the interesting topics. There are very few talks that are boring. So you will want to understand what you hear. Also, there are immediate reactions from the audience. This is not to be underestimated, as you, the learner, want to see yourself within this happy crowd of understanders. Every time the audience laughs or applauds you learn something extra.
What else? You can copy the subtitles as a whole when you see the talks on YouTube. There is this function on the bottom right, when you click on the three dots (...). There you can open the transcript, switch off the time code, and copy all of it. What I do next is to paste it into a HTML document and open it in a browser to eliminate the paragraphs. Next I copy it into my Word document called "TED in Farsi" or whatever the language, underneath the previous texts. Sometimes I find translations to English or German, then I also copy them. If not and it is a European language, I use Google Translate plus one or two dictionaries (e.g. Langenscheidt + Leo), for some expressions Reverso is a good translator.
I analyze the texts in a way that I understand everything and write translations of unknown or unsufficiently known words and expressions on top of the expression in the way shown on the right. There is this "x2" function in Word next to the functions bold, italics, font size etc. Once translated, I listen and read at the same time, checking if I missed a word or if there is something difficult. In the sample here it was a particular pleasure to find this black humor and the dry way it was presented in the finale of the sentence: "Según parece, el consumo de droga en el ejército americano en Vietnam alcanzaba unos niveles tan astronómicos que formaba parte de la vida y de la cultura de allídort, igual que el rock and roll o disparar a la gente pobre".
Farsi can also be analyzed this way. There often are formatting problems with Farsi and Arabic texts because of the writing direction. But this here works well most of the times. In Farsi - and Arabic, but I don't collect Arabic talks - there is the additional problem of standard versus spoken language: Sometimes the subtitles deviate considerably from the original, be it in word choice, word order or grammar. This is a challenge, but in a way it also is an advantage, as you can learn two things at once - given you know the language well enough.
In this manner, I have by now collected and analyzed several talks in several languages: Dutch (3), Italian (3), Spanish (4), Portuguese (2), French (40), Farsi (2), Turkish (1), Greek (1). There is a freeware for converting YouTube videos into mp3s, so I can save the audios on my mobile phone and listen to the talks when I hike through the woods adjacent to my apartment. Sometimes I only listen, sometimes I read and listen. Sometimes also, I mark a word or remove the mark of another. Below you can see my personal TED book with all the texts I analyzed so far. I spent some time ornamenting it with pics like an enthusiastic child (which part of me still is, fortunately).
Can you find all languages in TED? No. Actually, it is not so obvious which languages are available because you cannot filter talks in specific languages. On
this page you can see all the languages, some of which have more than 3000 contributions. For my purposes, I am particularly happy with the four Romance languages fr, it, es, pt and with Dutch. I did not find anything in Danish, Swedish, Norwegian or Icelandic, but one in Finnish with both Finnish and English subs. For Farsi and Turkish I need translations (Google is not enough here), which restricts the choice. There are some Turkish talks with Arabic subs, that's OK for me. All in all, there are more than 1000 talks or translations in the following languages: Arabic, Bulgarian, Chinese, Croatian, Czech, Dutch, English, French, German, Greek, Hebrew, Hungarian, Indonesian, Italian, Japanese, Korean, Persian, Polish, Portuguese, Romanian, Russian, Serbian, Spanish, Swedish, Thai, Turkish, Ukrainian, Vietnamese.
Now, those texts are not in the first place meant for language learners, and most speakers will have no idea that their talks are used for that purpose. So it is not easy. When I translated/analyzed Spanish and Portuguese, for example, I seemed to understand a lot, but when I listened to it without the text there was almost nada left. The idea really is to listen very often, maybe ten times. Each time you will understand a little more, particularly when you return to the written text every now and then.
Of course, the TED-based language learner will watch out for speakers with clear articulation, without mumbling and muttering. Maybe even more important is to choose voices you like. Remember the child who loves mummy, including her voice. Her voice calms the child, gives it confidence. And so the child learns, it wants to come closer to the mother (and father etc.). This is significant! For example: Etienne Chouard's French piece "Chercher la cause des causes" is quite difficult because he is getting faster and faster, but I like him, his ideas, approach and his whole presence. So instead of discarding the piece, which at first I wanted to do, I on the contrary focussed on it successfully. (For the same reason, I watch the YouTube channel "Tev - Ici Japon")
Of course, you can also add translations for TED talks in your own language, and this is a good and even supportive way of learning, too. So far, I only produced subtitles for "Français avec Pierre" when I was at a loss with "tous, tout, toute, toutes, touts" and with "de, des, du, d', de la" - after translating the respective videos it became much easier. Right now I am writing subtitles for one of my own videos, a one-hour talk on creativity and art. German and English are finished, Arabic is following - which will take some time due to the cultural differences. Concepts like "sensitization" or "trance" can easily be brought to European languages, but Arabic has its own concepts, unrelated to the Latin tradition.
(Mar 20, 2019) In February I resumed my work on the Wörterkiste (word box) after a ten-month break. As the subtitle suggests, the book is a list of major words, made for self-learners of any language. As I am putting quite an effort into this project, I would like to explain it a little.
When you learn languages you will encounter the same topics over and over again in every language: words, grammar, texts. So I thought: Why not collect one single structured word list that can be used for every new (and previous) language? Of course, such lists already exist. Yet, as a cognitive linguist with a specific set of languages and as a learner, it seemed to make sense to come up with a list of my own, the Wörterkiste.
The idea of the book is that the user fills in the word translations by her- or himself, as part of the learning process. Maybe this is a silly idea, I don't know. But as I learn this way myself anyway, I can just as well offer this method and the material to others. So I brought it into manuscript form relatively early, with a foreword, a table of content, then the main part with about 130 pages in the form of the sample on the right, followed by a summary of all the words in the same order on 20 pages and finally a register for looking up words alphabetically.
Last year I spent about two months on collecting and arranging words, all in German, my mother tongue. I did browse through word lists in different languages, but mostly found the words by myself, keeping a notebook handy wherever I went. So the list was in constant change, which it still is. I realized that it was necessary to test the words by translating parts into different languages, preferably not all from the same language family. As could be expected, some words were no problem at all, for example animals and plants. But even when you only translate into one single language you will soon discover how many words are ambiguous in ways you would never have thought of. "Youth", for example, can be a bunch of youngsters, but it can also refer to the time opposite to old age. And there are languages that distinguish the two, like Malay. The first one would be "orang muda", the second "masa muda".
Apart from taxonomic lists like body parts, colors, countries, numbers etc., the Wörterkiste creates contexts. So underneath the entry for "mouth" you will find the verb "to suck" and under "tongue" there is "to lick". Words connected to pets and pet-keeping are collected together, as they form a semantic group, and they are easier to grasp in this context. Positive character traits like generosity, modesty, sincerety are subsumed in the subchapter "My Hero/Heroin" and negative traits like stinginess, impoliteness, laziness are grouped under "The Black Sheep". In most cases, though, I kept opposites together, as they help clarify meaning. Other scenarios are concepts we use in the context of telephones: call, busy line, hanging up, dial etc. So they are not just lists, but the words are provided in some kind of context and sometimes with an additional sub-context.
About a month ago I resumed the work, as mentioned, and this time I started using an excel document. This is much more convenient, as now I can search for and find any word in the Wörterkiste in no-time. The other day I had a Syrian guest who told me the Arabic verb for the sound doves make, "hadala", and I added it immediately to the excel spreadsheet. This word is a bit specific, but I like to know the verbs for animal sounds in different languages. And why not? With the new document I am also able to fit in new languages according to language families, and I can switch the columns the way I like, make excerpts etc.
It did not take long until I had added new languages, so that now the Wörterkiste is evolving in twenty-five languages. The speadsheet is so broad that I have to show you this sample in two parts, see above and below. So now I am getting to the nitty-gritty. When I work on the Wörterkiste I sometimes concentrate on one language and try to translate one chapter or half of it. At other times I take a cluster of five or ten words and go through the individual languages. Often I shift words and whole groups from one place to another, there are always new words to add and some existing entries to cross out, as they turned out to be not as clear-cut or as relevant as I had originally thought.
But how can I know the right translation when I have no idea of the language, maybe don't even read the alphabet? Well, there are tricks. First of all, there are the big multi-lingual online dictionaries like Langenscheidt, Pons and World Hippo, also Glosbe, Reverso, Bab.la, Dict and some others. The Lexilogos site connects you with major online dictionaries for each language, but they don't have all the good stuff, you must search for yourself, too. Among the dictionaries there are good ones and bad ones. The good ones explain word context by defining in detail the German/English word, some provide compound words that clarify, for example, whether the word "language" means "human speech" (as in language ability) or "set of vocabulary and grammar" (as in foreign language). In Somali, the first will be translated as "lughad", the second as "af". Bilingual dictionaries are also helpful, of course. I found Cambridge for Malay and Collins for Hindi, for example. And I don't need German- or English-based dictionaries, for French and Arabic are OK for me, too. Even Italian and Dutch work. And my knowledge of Persian often helps when it comes to Kurdish and Urdu/Hindi words.
Then there is Wikipedia. It has many words with context, and it is a very useful tool. Double- and triple-checking is another important method, using different sources for the same word. And when nothing helps, you can still type "How do you say X in the Language Y?" into your search engine or ask a native. In the process, there is so much to learn. One day I spend with some alphabets I am not yet familiar with, another day I focus on one language or on a specific word group. Some grammar here and there plus language videos from time to time. Right now, after a month of working like this, I feel equipped and ready to go. Well, to go ... on. It will take about six years to accomplish it, if I don't leave the task for longer spells. Looking forward to it!
(Jan 16, 2019) It was twelve days ago. I turned on Paul's Langfocus Channel (YouTube) during my lunch break, being in the middle of analyzing the 82-item YouTube playlist of French TED talks with subtitles. Paul has a nice way of introducing languages and he certainly shares a general love for them. I chose the video by mere accident; it was one of the few I had not watched before. So Toki Pona, OK. He explained that it is an invented language. "Boring", I thought, but let's see. Then Paul said that it is a special language because it consists of only 120 words. Suddenly I was all awake. What? What - was - this? While watching the video to the end I learned with sample sentences that "li" separates subject from predicate, that "e" introduces a direct object after a transitive verb and that you can order multifold compound nouns/adjectives via the separator "pi". But my mind was already detached from terrestrial spheres and going its own way. What - is - this?!
In order to communicate my epiphany adequately, there are some things to be explained. Toki Pona is a minimalist language, and I am a minimalist artist. That's number one. It has always been part of my art to reduce meaning, material etc. to a minimum, avoiding verbiage, waste and abstraction (Ezra Pound: "No verbiage!"; "Go in fear of abstraction!" - search for the mind-blowing "Imagistes", 105 years ago, London/Paris), getting to the core of things, no mincing of words. Particularly in my poetry and songwriting. Number two is even cooler because both Toki Pona and my way of life are heavily influenced by Taoism. The experimental Bamboo series of 250 one-pagers, for example, the backbone of my literary production, was immediately inspired by Lao Tse, as you can read right on top of the preface of the first book (in German).
With number three we are getting to the linguistic part. When I studied Anglistics in Hamburg in the early nineties, there was an interesting kind of movement going on in the linguistics department. It attracted me so much that I soon decided to enter the linguistic rather than the literary department of the branch. In Germany's humanities you usually have one major and two minor subjects, but two major subjects are also possible, and so I settled on Islamic Studies (Arabic + history) and Anglistics. I was intrigued by the innovative Dr. Roger Böhm and Prof. Radden on the basis of my knowledge of Wittgenstein from school days.
For example, I learned about case grammar. There were some blokes in recent history who said: OK, we have different sets of cases in different languages, like nominative, genetive, accusative, dative, instrumental, locative etc. Couldn't there be a deep level where all languages meet? So one guy came up with a list of over 100 cases and others thought: Wait a minute! Let's take the minimum number and combine them! So they said: abs (absolutive), loc (locative), abl (ablative), erg (ergative). And then they combined them, like in the sentence: The pirate (erg, abl) handed us (loc, erg, abs) the loot (abs). - I liked that, just as I liked George Lakoff's theories in cognitive science: metaphors, metonymies, prototypes etc. And Noam's deep grammar to begin with. It was all about how language works (in the mind).
Lakoff had just published his groundbreaking "Metaphor and War: The Metaphor System Used to Justify War in the Gulf" plus follow-ups via a new thing called the Internet, he was my total hero. And in the department, students and staff were collecting everyday metaphors by the thousands.
My primary interest in linguistics has always been understanding. Not, for example, coming up with a new theory or making money. The idea rather is: when you understand your own language you will better understand your own mind. We people have these incredibly complex and beautiful abilities to speak and create art.
We want to understand what this is all about. I want to know what it does to my mind when I have gathered experience in this language. Will it reduce my thoughts to an essence? Will it bring out the essence of my thoughts? How deep can Toki Pona go? Does it make sense to write poems in Toki Pona? Are there enough "atoms"? Will the expressed thoughts be hollow or vague? There are some poems from Loving Jay I want to translate at a later stage, and more flea cartoons. To see how it is when Anis language meets Toki Pona.
The Garden of Eden
So OK, we have (slightly more than) 120 words at our disposal. The first interesting question is: which concepts are NOT included? Taoist question. No day and night - wow - no danger, part, joy, help, anger, peace, friend, sister. This means: even such basic seeming concepts can be broken down, like molecules into atoms. Yeah, OK, they can, but does it work? Well, seventeen years of visibly growing success say: yes. What started as a private game by the Canadian linguist Sonja Lang was first publicized in 2001. Her official book "Toki Pona. The Language of Good" only appeared in 2014, and then the language became a ... what? Community, inspiration, whim without borders, computer language guinee pig, playground for freaks, addictive game? In any case, it obviously works and thrives.
Sonja Lang constructed a language similar to the case grammar mentioned above: she combined minimal semantic entities. If you now take into account - as number four - the work on the Wörterkiste (word box) I mentioned in the previous post you will understand why I spent three consecutive days producing a chart of the Toki Pona words, like a taxonomy of chemical elements. I will always remember these days as being a long trance vacation in the Garden of Eden. On the picture above you can see an excerpt of the three-page chart. I spent happy hours shifting the boxes from here to there, giving them colors according to meaning (social activity, body parts, measurements, physical objects, etc.) and adding meanings from different online lists. It was as if I knew exactly was I was doing and what I wanted as a result. It is still being updated, but basically finished.
Entering the inspiring game we can watch how minimal language works, indeed how language works, deprived of all fat, rid of skin and flesh, down to the bone. I translated and posted a flea cartoon on the toki pona facebook page. That was fun. "Fleas" are "pipi lili" (little insect). The translation is: "Mama, why are the fleas in the sky all white?"
I also asked myself how it will be like when you have invented a language privately, and then suddenly it takes off like that. I mean: search for Toki Pona on YouTube alone and you will know what I mean. I listened to the 40-minute interview by Kris Broholm (2014) with Sonja Lang (interview starts at min 5:00), and she explained that indeed the language has developed due to usage. Well, this is what languages do, they develop when they are used. It is hard to imagine what it is like for Sonja, though. Surely an adventure.
Once I came up with parts of a language myself, when I wrote the sci-fi fragment Omega 5. This was long before I even heard the term con(structed) language. But in that case it was not an attempt to construct a full language, I just needed some philosophical terms that were not so well-worn and that combined different concepts.
How to Approach Toki Pona?
For me, all languages are different to learn. Toki Pona is special because you cannot turn on a news channel or radio stream. You cannot listen to natives talking in their natural habitat because there is no such thing.
So after the collection and ordering of the Toki Pona words I went over to grammar and syntax, using sample sentences and giving them colors again: Noun phrases in orange, predicates in blue, prepositions and adverbial phrases and the rest in green. For the small words I used lighter colors. When you really dig into it you will find incoherences, but for working purposes it is fine.
We have little grammar, as there is no (grammatical) singular and plural in Toki Pona, only one case marker "e" for the direct object, no conjugation, no gender, no tenses, no mode except for the imperative. Therefore, it is best to learn with examples and to integrate idiomatic usages in the grammatical features, like the ways to ask questions or to talk about time. Grammar in Toki Pona is syntax-focused and much closer to idiomatic terms than elsewhere.
The picture above is an excerpt of a dense page with sample sentences to explain all grammar, syntax and typical usage of more complex words like "lon" or "tawa". So whenever I come across a new idiom or a structure I newly understand, there will be a place to store it.
I still felt unable to find my way around in this language until I had produced a list of Toki Pona compound words. There can never be a real dictionary because it is part of the tp philosophy and challenge to make things up according to context. But there are some very often used compounds you will have to know, anyway. My simple alphabetical list consists of 440 entries now, including some that are not obvious. When I come across a new one I will add it. Reading through the list also deepens the knowledge of some tp concepts and grammatical/semantic relations.
The final element are texts. There are some collections with and without translations online, and I filled some pages and read them again and again as I get closer to the end of my 14-day intense trip. It is true: you can really learn this mini-language to a considerable extent within a fortnight.
Once I browsed Toki Pona links I realized how much is going on. There are dozens, no, hundreds of contributions: blog entries like this one here, descriptions in many different languages, poetry and prose, articles about grammatical details, link lists and meta link lists (this here is the most comprehensive one I found). There is a lot of love in all this.
Now I have arrived, and it is a good feeling. While being on the brink of resuming other languages, every now and then I will see a facebook post in Toki Pona, like the riddle of the day. I will keep on trying out links and reading stuff. It is like a new companion. Also, with this new experience, my other languages are starting to ... change. They seem less complicated than usual. Amazing side-effect.
The official website is
tokipona.org and there is the extensive
Wikipedia has a lot of material. See
jan Pije and
John Clifford, too. The new YouTube channel
seme li sin? (= What's New?) is posting news bulletins in Toki Pona. With these links you will be able to find your way around and get hold of even more material.
There is a lot of motion and development in the field of Toki Pona. It surely is an established language by now and has gone through some minor changes. At the same time, there is a lot left to do for people who want to contribute in one way or another.
To conclude (for the time being) I would like to support "Fingtam Languages" (YouTube)'s view that it pays to learn Toki Pona, this constructed language that does not claim to serve as a world language like Esperanto. To wrap it up: Toki Pona is extremely simple to learn, it shows you how language works, it is fun, it brings you to essential talk and thought, and it is quite a phenomenon if ever I saw one.
(Jan 08, 2019) It's been a long time and a lot of things happened. I couldn't keep up with this blog as it would have distracted me from the things I actually wanted to do. So what has happened on the language front since April 2017? Well, I delved into chess and go for a while and found myself affirmed when I heard ex-chess world champion Anand in his Hawaii interview on YouTube, where he stated that chess is like a language. Later I started with Dutch and followed the cute series "Heb je zin?" (Are you up to?). I also began learning Italian on a mild scale, just to get used to the sounds and most frequent words. I went back to the Goethe 2 (Fifty Languages) site and made myself new audio copies: French with Turkish translation, Italian with Dutch, Spanish with Danish.
In 2018 I started thinking about a general vocabulary inventory that can be used for learning all kinds of languages. I developed "Die Wörterkiste" (word box), a book with now around 7000 German words, ordered according to context (e.g. animals, colors, science words). I also created association chains like "In the House", there you walk through the individual rooms and look at the inventory and some verbs. Or "Travel" with travel words ordered partly like a story. For months I had been shifting words and clusters from here to there, reconsidering, enlarging things, melting things together etc. In April I started translating parts of the Word Box into nine languages to check its validity and usefulness. I had time because in my bread job that year there was nothing much to do.
At first, I was going to publish the book as a tool for language learners like myself: You acquaint yourself with the book and then start to fill in the words in your target language as a game or an exercise, according to your personal approach. But then I was not content and felt the need to know more languages before finalizing the Word Box. I should have at least a forth fluent language at my disposal before resuming the box. So I chose French because I had one year in school, then an intense phase in 1993 and at least two more intense phases, all in all with little success.
Thus from May until the end of the year I had an intense French course with at least five daily hours on average. I translated all 172 Brel songs into German and listened to them often. I collected texts with audios, inserted vocabulary where needed and read while listening. Le Petit Prince (5x), Monsieur Ibrahim (4x), Boule de Suif (5x) and many others. Cyprien, L'Histoire racontée par des chaussettes, TED en Français ... I listened to Stendhal, Le Rouge et le Noir, and to Jule Verne's long novel "Les enfants de Capitaine Grant". I watched documentaries, with and without subtitles, arte, c'est pas sorcier, RT France, comedy, grammar videos and what not. I did not talk, though, and hardly wrote anything. But that's OK, as long as I feel I am learning a lot. I don't need French in my daily life. I just want to understand EVERYTHING about it, that's all.
Toward the very end of the year I slowed down the gallop. This was when I re-discovered citybooks.eu and understood its true value. I took the half-hour text by Cees Nooteboom about Venice and ordered the languages next to each other: English, Dutch, French and Italian. There are high quality audios in all four languages. Here is the top bit of 25 pages:
So this is where I stood on January 4, 2019, when I came across Toki Pona ...
PS: On Jan 18, citybooks published my fan article about them on their website in English and Dutch. Very enjoyable.
(April 15, 2017) In the video I linked here, polyglot Alexander Arguelles opens a horizon for people who love learning languages. One of his example arguments is that in the world of sports the world record for running 100 meters had for a long time been stable. But as soon as the record was broken, it was broken several times in a row. Arguelles compares this to language acquisition: Once it becomes normal to know several languages one realizes that the human limitations are not as narrow as one had thought they were. Arguelles, for example, is learning languages by clusters, not just one at a time.
For me, after a first in-depth study of Somali I got back to Farsi and am entering a new level because I use my Farsi almost every day when I talk to people from Afghanistan and Iran. There is a guy in the camp, where my office is, and we knew each other before I started learning Farsi last June. When we sit together now, the situation is much different. We can talk in full sentences :-)
After that weekend I came across Mitsurugi's blog 'Nabad iyo Caano' and found that he gave a very useful overview with grammar, vocabulary and texts for beginners, and with the spirit of a student who is able to think as a linguist. Most of the 229 entries date back to 2011/2012. I copied and pasted everything into one word document and at first reduced it from 150 pages to 50. Then I had some extra time due to a one-week vacation. I thoroughly worked on Mitsurugi's paper and printed out a new version every day for five consecutive days, until everything was perfectly formatted, all Somali words in red color, similar parts fused, reduced to the substantial, ordered. The focused occupation with this text gave me a fundament for learning the language. In the end - that was yesterday - I got the whole text on nine sheets of paper, duplex print with two pages on one, 36 pages total. Total working time was something around 40 hours. I kept the previous versions as a souvenir. Tidying up and making order is my life-theme, this is why I can learn so well while doing it. I love it.
Also yesterday I expanded my vocabulary page from half a page to three pages with a total of about 1000 words and English translation. A mini-dictionary in one paragraph: Somali word in red + hyphen + English word + semicolon; Somali word in red + hyphen + English word + semicolon etc. Then I changed the colors of the verbs to blue, adjectives to green, colors to light green and linguistic technical terms in brown.
There is a good grammar out in German, I ordered it, although it is 60 euros. Online I found the complete amazing 'Qaamuuska Af Soomaaliga' (Puglielli/Mansuur, Rome 2012, PDF), a modern Somali-Somali dictionary with 970 pages (!) and a fine grammar section in the end with charts and tables. I am working on that right now, until Jörg Berchem's grammar arrives. I still have some days vacation, today is only Thursday.
I found four online dictionaries: translate.google.com, freelang.net/online/somali.php, glosbe.com/en/so and lexilogos.com/english/somali_dictionary.htm, but I cannot yet say how good they are. I did not need them much yet. Good is learnsomali.com. Then for audios, there is a zip file on mylanguages.org with about 700 mp3s, each with one word or expression or sentence spoken in American and twice in Somali. Seems to be a manual for US soldiers, the choice of sentences leaves no doubt ('Stop, or I will shoot!', Follow our orders!', 'We are Americans, you are safe' etc., but also normal stuff.) There is no written text to go with it, but I don't miss it. I know many of the words from my recent hours of private editing. Yesterday night I listened through it, I think it was more than an hour. (PS on April 24, 2017: After some research I also found the written text online plus several more chapters of audios with PDF. The politics behind the material inhibits me, but I do use it.)
OK, this is where I stand right now. So why Somali to begin with? Well, the next larger refugee nationalities in my surroundings are Somali and Eritrean. Somali is spoken by about 17 million people, Tigrinya (the main language in Eritrea) has less than half that amount of speakers. It also has a complicated script of its own, while Somali today is written in Latin characters. That was basically enough for the choice. Many Somalis speak Arabic, and there are many loan words from Arabic, too. But not a quarter as many as in Farsi.
There are some peculiarities in Somali, like two 'we's, one is 'we without you', one is 'we including you'. There are so-called focus words and classifyers 'baa', 'waa' etc. which put the focus on a specific part of the sentence to highlight it. Grammar is full of small words like prepositions, suffixes and pronouns that fuse with each other and with other words, often with shifting consonants and vowels. For me, this language is far out. It is not similar to any language I learned before. It is not related to Arabic, but shares some significant phonemes like emphatic q, 'ain and emphatic h, all produced deep in the throat. The d sound is close to r sometimes like in India and Pakistan, some other times it sounds like a usual d sound, then again like a voiced th sound as in 'the'. The tone and melody of the words and sentences are much different from what I know. All in all, I am - you can see it - fascinated with this language in a new way. And I can relate to the music, this is important. I don't go much for Arabic music, you see, and this here is close to blues, reggae and other styles that I know from US music. The language is very musical, very oral. The throat sounds and occasional staccatos give it a rough, Klingon touch, but then there is this tender and almost sung overall discourse that embeds it.