Manmade

What have I done?

2024-10-13T00:00:00+00:00

It’s been a while since I’ve written any blog posts. When I think about my heroes, a few names come to mind: Christopher Hitchens, Jessica Valenti, Chomsky, Ambedkar, and Tom Scott. I see the failings of some of them now, but I still consider them my heroes because I was obsessed with anything and everything they wrote or said that I could find. For a short while. When I realized I look up to Tom Scott, I understood why I was obsessed with them — they kept writing and working on new projects. They were never satisfied. I’ve been paralyzed internally trying to find satisfaction, or re-gain my attention, when the answer is quite simple; There is no closure, I just need to keep working on new projects (and writing).

Publishing DeTeXt on the App Store was exhilarating, because I made it. The last few years of my life have been the best, and the worst. But the numbness that I recognize internally, can be traced back to the root cause of failing to make things. Anything. This blog post, an app, research, kozhukattai…I should be able to say ‘I made this’ much more often.

Authentic Schmauthentic

2023-12-30T00:00:00+00:00

What the fuck does authentic mean anyway? … The term ‘‘authentic’’ … is essentially meaningless. ‘‘Authentic’’ when? ‘‘Authentic’’ to whom? But it sounds good and wise doesn’t it?

Anthony Bourdain. ‘‘14. Alan Richman is a Douchebag’’. Medium Raw

Some things leave a clear semantic trace in my mind — I don’t remember the exact words¹, nor the circumstances, but they leave a dent. In my mind, Bourdain’s acerbic takedown of a journalist questioning the authenticity of Creole cuisine is up there with Siracusa expounding on sports, or the final shot in Ozu’s The Only Son. They contain multitudes in so little, and not just because they’re akin to pithy aphorisms — they nudge me to research, read, learn, and ponder on their meaning endlessly. They enrich my life, my work, and relationships, even if it involves uncomfortable introspection.

Questioning the authenticity of food especially evoke a deep-seated rejection fermented by years of pondering on Bourdain’s words, and my own cowering behind that empty word. This isn’t to say one can’t criticize food (or any cultural product), but if you see fit to judge and expound on it, be decent enough to admit what you’re doing and why — maybe you wanna feel superior, or occupy space in the conversation, or be seen. You can do that without empty gestures towards authenticity.

Funnily enough, what I yearn is for people to be genuinely authentic (in the sense of be honest about what you’re feeling or wanting) with me.

which is just as well, since I enjoyed re-reading this chapter, and the book, perhaps just as much as the first time, in the process of finding this excerpt. Bourdain could write and cook — I’m still not good enough at the former. ↩

The Responsibility of Non Residential Intellectuals

2023-12-09T00:00:00+00:00

It is easy for an American intellectual to deliver homilies on the virtues of freedom and liberty, but if he is really concerned about, say, Chinese totalitarianism or the burdens imposed on the Chinese peasantry in forced industrialization, then he should face a task that is infinitely more important and challenging—the task of creating, in the United States, the intellectual and moral climate, as well as the social and economic conditions, that would permit this country to participate in modernization and development in a way commensurate with its material wealth and technical capacity.

— The Responsibility of Intellectuals, Noam Chomsky

I have lived in the US for over 6 years. I feel like I’ve come of age here — certainly I didn’t dwell on my responsibilities much when I lived in India. I want to live here, but what of this responsibility I feel (with my freedoms) to do something about injustice in India? Why do I feel this strong responsibility to oppose injustice?

Answering these questions¹ are important to me, since it clarifies the root of disagreements with family, friends and colleagues, and how to effectively fight injustice in India. I don’t have a great answer yet — but the strong whiff of bullshit from me (and sensed by me) when opposing injustices in the US when India made me is a good enough motivator to find an answer.

So what are the responsibilities of the non-resident intellectual beyond seeking the truth back home, and the historical context of the truth? Supporting intellectuals and activists back home must be one of them, if not the foremost one.

which resurfaced after a tiring conversation with an amoral, fatalistic friend ↩

I’m thinking of ending things with attention again

2023-10-09T00:00:00+00:00

Two years ago, I thought I would ‘discipline’ myself by being deliberate with the way I spend time online. I have failed, quite spectacularly at that. I can’t keep myself off hacker news, reddit, twitter, or any of the million blogs out there. More often than not, there’s nothing there when I visit them. But the random reinforcement of something that might scratch my itch always has me coming back.

What is to be done? The fix-the-system crowd wax philosophy about how loverly it would be in their world where we’re one with nature, and how we need to organize and fight corporations. My patience is thin with these pleas, especially with this issue. What’s more likely to happen — that I discipline how I allocate my attention, or that the internet economy re-aligns itself through self or external regulation around the amorphous concern of ‘attention’?

I’ll take my chances with disciplining myself.

The centre is not where it seems

2023-09-05T00:00:00+00:00

Nick Heer talking about the centered Apple logo on iPhones 11 (and all iPhones after that as of the writing of this post):

I can’t find it right now, but I remember an old piece of advice — possibly in the HIG — that said that items mathematically centred vertically tend to look like they’re lower than they are. The suggestion was that a visually vertically centred item typically needed about twice as much space below the item compared to the space above it.

Page 184 of Apple’s HIG guidelines from the 2003 edition:

[the] distance from the bottom of the window to the top of the Dock (if it’s at the bottom of the screen) should be approximately twice the distance as that from the bottom of the menu bar to the top of the window.

Matthew Butterick on centering page contents using margins:

To make the text block appear centered vertically, try making the bottom margin about a 0.25″ larger than the top margin. Otherwise, the text block can look like it’s sagging.

Finally, David Lynch as John Ford to Sammy Fabelman (Spielberg):

Where is the horizon?…Where is the goddamn horizon!?…When the horizon’s at the bottom it’s interesting. When the horizon’s at the top, it’s interesting. When the horizon’s in the middle, it’s boring as shit.

Eternal Darkness of the Mutating Thought

2023-08-25T00:00:00+00:00

Human memory is flawed and gets worse with time. One phenomenon where I face my own memory’s fallibility, with increasing regularity, is the contrast between an actual event that occurred in my past, and the memory in my mind many years after. I wanted to record one as I process it, even though it is a memory of a poem I read, so not strictly an event.

Only one Thirukural of the hundreds we had to memorize in school has stuck with me. I don’t remember the words of course, only the rough meaning:

A son’s duty (to his parents) is to ensure that others speak highly of/praise him to his parents.

As I write this, I probably read this kural 12 – 15 years ago. As it turns out, this wasn’t one kural leaving a trace in my memory, but two. Both are from the Book of Virtue, in the chapter dealing with children. The first, kural 69 is about one’s duty to their mother¹:

Eendra Pozhudhin Peridhuvakkum Thanmakanaich
Chaandron Enakketta Thaai

A woman rejoices at the birth of a son, But even more when he is praised.

Kural 70 is about duty to the father:

Makandhandhaikku Aatrum Udhavi IvandhandhaiEnnotraan
Kol Enum Sol

The son’s duty to his father is to make world ask, ‘By what austerities did he merit such a son!

I’m glad to realize now, in my wise age of 29, that even Valluvar had to resort to padding. Usually I lament my memory’s fallibility, but in this case, I’m glad my mind saw the redundancy and reduced them to one thought.

Unfortunately, the amount of mindspace this thought occupies is more than two or four verses, and will only grow with time. Every day is a slow realization how true the verses are when they shouldn’t be, and a reminder how I’ve not done enough, even though I’m enough.

in transliterated Tamil. I cannot be bothered to get fonts for Tamil on my blog just yet ↩

Meaning what?

2023-08-18T00:00:00+00:00

Paul Grice’s 1957 paper on meaning is a favorite of mine. His style is an acquired taste, but once you get used to it, it is an absolute delight to read. For instance, his example contrasting the photograph with the drawing, while instructive, took me a long time to fully comprehend:

(1) I show Mr. X a photograph of Mr. Y displaying undue familiarity to Mrs. X.
(2) I draw a picture of Mr. Y behaving in this manner and show it to Mr. X.
I find that I want to deny that in ( I ) the photograph (or my showing it to Mr. X) meant,, anything at all; while I want to assert that in (2) the picture (or my drawing and showing it) meant something

The example, and his explanation that follows, leads him to come up with a succinct definition of non-natural meaning:

A uttered x with the intention of inducing a belief in B by means of the recognition of this intention.

Breaking down the different intentions in the definition is helpful. There are 3 distinct and inter-dependent intentions:

A intends to induce a belief in B.
A intends for B to recognize their intention to induce said belief.
A intends for B to gain this belief by means of recognition of the first intention.

I’ve been thinking a lot of this definition of meaning in the context of ‘AI generated’ art. The usual suspects are confused why ‘AI generated’ art hasn’t taken off. The obvious answer is that it’s because we know it’s not made by a person — if we know something was solely software generated, we lose our ability to connect. But is this the same thing as saying we know that the generated art is meaningless? A work of art is not an utterance, so Grice’s definition will need to be adapted, but what needs to change?

Originally, I was going to go conclude this post nicely by going through the intentions when a person makes art. But I realize now that this would be unsatisfactory without answering the questions I just raised, and answering those would be a much longer post. I’ll come back to this one day, but revisiting the paper, and writing down the definition was fun.

How to talk about people so that you alienate them

2023-07-15T00:00:00+00:00

I have one year left in my PhD. It’s fascinating looking back and reading my previous two (1, 2) blog posts on the topic of my research. In addition to loosely charting my thinking on the topic over 2 years, I’m glad I wrote about my feelings on it as well. My initial fear has evolved to a healthy motivational fear — I have to figure out the next step if I want to get things done. Wanting to be done with grad school is a great motivator at the moment. There is a small part of me that’s afraid that I might have nothing to show at the end of it all, but I’ve plowed through that fear so many times at this point it’s a trace of what it used to be.

This is an evolution even from my most recent post, where I was still quoting the Litany against fear for my research. Of all things. At this moment.

So what’s new since then? Two papers, a talk at EACL that was received well by a few choice people, and most importantly, an idea for a long-term research niche. Generalization, in Language, and across languages and communication mediums. A rich vein I can harvest for years to come. If I stay in academia.

Manmade

2023-04-11T00:00:00+00:00

This blog is called Manmade, but unless you subscribe through RSS, you wouldn’t know. Its partially from an oft-repeated Hitchens quip: ‘Religion is manmade’. However, I first recall hearing the compound in a longer context — Religion is manmade; in fact it is male-made. I’m paraphrasing for now since I can’t find the debate or talk where Hitchens said this. I’ve since heard versions of this said by many others, including Steve Jobs:

Everything around you that you call life was made up by people that were no smarter than you.

I’m thinking about this today, but in a different context than what Hitchens or Jobs ever intended (probably). All the bigotries and unjust hierarchies that surround us are manmade. The ‘traditions’ that we impose on others, even those we love are manmade. Culture is manmade — just because something is part of ‘our culture’ doesn’t make it moral. I wish it was easy for everyone to accept this truism. Until then, I’m resigned to suffering on account of the cultural obstinacy of loved ones, although I console myself with the thought (or promise to myself) that I would never impose the traditions and customs that comfort me on my children. The cycle ends with me.

Unrelated to all this, I feel…weird quoting Hitchens so often when I write here. One day I’ll write about my heroes, Hitchens (one of them), and how my thoughts and feelings with regards to him have evolved over the years.

I’d like to have a narrow, precise argument please

2023-03-31T00:00:00+00:00

I was reading Resident Contrarian’s recent blog post when a throwaway line at the beginning caught my attention, prompting a longer train of thought:

The most troublesome arguments to try to make are those that are both very specific and limited (and are by nature of that more or less non-controversial) …

I’ll need to re-read his post multiple times to understand his actual argument — I find that often true of him, and the rationalist community (even though he hates them). Until then, I want to address just these lines, and the sentiment behind them, which frustrates me in many aspects of my life.

Yes, specific and limited arguments are less controversial, but they’re not less interesting. I firmly believe being specific and narrow in your line of questioning or argument (or even interests) is a good thing. It usually doesn’t carry the baggage that RC is afraid of — or if it does, just state that you don’t want to generalize outright and move on.

I see this with research all the time; work on a niche subject or topic needs to be generalizable beyond the domain of analysis, or the authors will have a hard time with reviewers. You can’t just study, for instance, how Eagles fans talk about Cowboys fans online. What about other teams? What if its not online? What if its not sports? I get that a research question is always grounded in past and contemporary work, and we need to be cognizant and report that, but I think we’re overdoing it in Computational Linguistics and NLP right now. Anything and everything you do has to be generalizable, or fit into the existing work perfectly.

Not that this is a unique problem with research in my area, or research in general. Arguments online and in-person get derailed because we forget the specific circumstance or incident — it’s so much easier to generalize without actively wondering if it makes sense to generalize. I do this too, and am trying hard everyday to counter this (I suspect) deep-rooted tendency in my mind to find patterns, over-generalize, or at worst, stereotype.

I’m not happy with this post as it stands; I feel like I’m jumping between the specificity of arguments, questions, beliefs, and thoughts. That’s fine — I’ll always be thinking about this, and writing my thoughts down clarifies them for me. Maybe in a few months, I’ll revisit this topic. But today, I do want to mention and link to two comments that resonated with me, and possibly catalyzed my thinking on this topic. First, from Christopher Hitchens in this video (starting from 18 seconds in), on how to ask a question:

Here’s a piece of advice about asking a question. Try and narrow it down a bit… if you give me too much to chose from, you’re likely to get everything or nothing, and I certainly can’t give you everything…

Also, here’s Nikki Giovanni, starting at 21:10, on being parochial in her activism (the entire interview is amazing):

… because I tend to be parochial, for one thing, and I tend to care about Afro-Americans … It’s very parochial because I don’t care about my third world brothers and sisters … If I deal with my block, and you do with your block, we’ll have two good blocks.

Being parochial in one’s passions might seem like a different thing from what I was talking about earlier, but they’re not separate in my mind. I don’t think I’m overgeneralizing here to lump these two together. I think what they’re both saying succinctly describes the value of being narrow, be it in questions, arguments or interests.

Language is for writing is for thought

2023-03-11T00:00:00+00:00

Kate Manne, over on her lovely substack:

I write in this space in order to put ideas out there quickly, often in an unpolished form. I like the idea of sometimes making my mistakes in a relatively open forum. Unlike an article, or a book, which are years and years in the making, this substack newsletter capitalizes on my strong desire to write, sometimes quickly: to seize a few hours or even minutes here and there, and try to articulate ideas which have sometimes nagged at me for a decade or more without ever reaching fruition. By and large, I think by writing.

All my agonizing over what this blog is for, and what I should write about, and why, and someone else puts it down in better words than I ever could. Like it does for her, writing clarifies and helps me think. I don’t however, think by writing right now — I would, however, like to. I aspire for it, for its own sake.

The Bridges of Venkat County

2023-03-04T00:00:00+00:00

I’ve burnt a few bridges in my short life. Some bridges have water under them, but most don’t. I was obsessed with idioms when I started writing this post — perhaps because the reading for the research seminar that week was on Construction grammars, where idioms (and constructions of all kinds) are as essential as syntactic categories, lexical items and morphemes. I might be misunderstanding construction grammar considering I have only read a few papers about it, but I’ll take this opportunity regardless to overuse various constructions to death, violating one of Orwell’s maxims.

It was only recently that I realized that sometimes, frayed and broken relationships cannot be fixed. I could apologize, make up for my mistakes and shortcomings, undo all the harm I’ve caused, and yet. I think I learned that a sincere apology transfers the responsibility onto the other person to forgive, forget, and accept you back. The truth is that there is no moral or natural law that mandates people forgive, forget and accept after a sincere apology, nor would I want there to be.

All of this may seem obvious in hindsight, but this is my blog after all. I can write about the trivial as if they were life-altering revelations.

The finite, bounded interface trumps the infinite, unrestricted interface

2023-01-31T00:00:00+00:00

Me, back in October 2020:

It struck me that today’s AI assistants (Alexa, Google Assistant, Siri) are all based around having conversations. If these systems ever approached anything close to human intelligence and common-sense, perhaps having a conversation is the best way to interact with AI…maybe the best way to interact with artificial intelligence is the same way we interact with other people — using conversations.

Austin Henley, in a lovely blog post from a few days ago:

A great user interface guides me and offers nudges.

Couldn’t a natural language interface help with that?

Certainly.

But not as the only option. Probably not even the main interface.

My first thought after reading this was Why didn’t I think of that?! Everything Austin says about software and user interface design I (sort of) knew 2 years ago. People know what they want — but they can rarely express it well, especially using words. Relying solely on natural language to gauge user intent is lazy, and will lead to poorly designed software.

A brief digression to put down my perspective on software and UI design: I believe that self-imposed constraints and strong opinions (weakly held) are good and necessary towards building well-designed software. This path will lead to your product not appealing to everyone, but that’s okay! My favorite apps are opinionated and have relatively a small/medium-sized customer base; but that’s all they ever wanted. It always frustrates me when a new service/product clearly has aims to be ‘the next big thing’ from the start. Why do you need millions of users/customers? If you’re one person building something new, you only need thousands.

Circling back: my original post was about the best way to interact with virtual assistants like Siri. Natural language is the only interface for this class of software. They’ve been around for a decade, and while useful (especially for accessibility), they haven’t revolutionized how we do things everyday, nor have they opened up new opportunities, I think. How do we make people successful at using computers using a lazy interface?

I hope I’ll play a role in answering that question one day.

Why so glib?

2023-01-30T00:00:00+00:00

Two years ago, I wrote a post right as I started my study of interpersonal biases in language. I don’t remember what my state of mind was when I wrote it, but I do know what it is like right now. If anyone embarking on a Ph.D. is reading this, I’d love for them to know how tortured the journey was to get to where I am. The only idea I had back then was computationally replicate and evaluate the LIB. Now, after many missteps and detours, and a lot of nudging from David & Jessy, I have (I think) a topic that I can call mine; a research program that no-one else has worked on (yet) that I can initiate and contribute to. Generalized (Linguistic) Intergroup Bias is what I’ve termed it.

I feel relieved and optimistic at this juncture in my Ph.D. And yet, the fear is still there, receded in the background. I can envision scenarios where it engulfs me again, but not many. In any case, I will face it when it comes, and only I will remain after it passes.

Will my research program lead anywhere, change anyone’s mind, or even be one that anyone apart from me finds interesting? Probably not, but I’ve come to realize that all that matters is I do the work, do it well, and that I grow during the process. Like programming languages, I’ve come to learn that the research programs that gain traction in a community, especially a relatively young one like Computational Linguistics, have little to do with the program’s promised benefits — external factors like the progenitor’s community and its ease of use play a big role. I can only aim to do the best on what I control — ensuring that I pursue my research questions honestly and rigorously.

Addendum

I was a little too optimistic with giving my own name to my focus of study, but it was good in the end to get pushback on Generalized Intergroup Bias or GIB. I study intergroup bias; how is in-group language different from out-group language? That’s the one sentence summary of my dissertation, and I’m quite happy with it. My own term and acronym would have just made it too confusing.

Disclosure Triangle of Sadness

2023-01-13T00:00:00+00:00

One of the advantages of not being a full-time iOS app developer is that I can spend an unreasonable amount of time on minuscule details of app design. Paying attention to the details is important and a sign that you care — but it’s rarely noticed nor rewarded by others, while shipping something that works is. In any case, sweating the details is a luxury I can afford now, which brings me to a UI design curiosity of Apple’s that sent me down a rabbit hole. Consider the empty tab page on Safari:

I’ve circled the disclosure control next to the ‘Shared With You’ section that lets you expand and show more items. Now consider the same New Tab page on iOS Safari:

On the Mac, the control for expanding ‘Shared With You’ is a downwards-pointing chevron; it becomes upwards-pointing when you expand it. On iOS, it is downwards-pointing after you expand it, while it points to the side when the section is collapsed.

The first thought that came to mind when I noticed the different direction of the disclosure links was Gruber’s post on what disclosure controls ought to do:

In the iOS/Mac style, a right-pointing chevron (or triangle, depending on the OS) indicates the collapsed state, and a down-pointing chevron indicates the expanded state.

Based on that article, I thought I had found another instance of Apple’s own apps not following the HIG, but that isn’t the case. Based on my reading of Apple’s HIG, this control is a disclosure button on the Mac, but a disclosure link on iOS. Now I have two questions:

Why do disclosure links show state and imply action while disclosure buttons show action and imply state?
Why is it a button on the Mac and a link on iOS?

I feel like there ought to be a clear answer to the first one, but the examples in the HIG don’t elucidate the differences to me. Regarding the second question, I think its a mistake by whoever designed it. Disclosure links are all over macOS and they do exactly what this disclosure button in Safari does: expand a list of items to show more. Why should this control be different?

Bye Hitch

2022-12-17T00:00:00+00:00

Christopher Hitchens:

In the ordinary moral universe, the good will do the best they can, the worst will do the worst they can, but if you want to make good people do wicked things, you’ll need religion.

I used to believe this to be true, as I did many things that Hitch wrote. The statement is obviously false, as well stubbornly simplistic. I’m not too concerned with the semantics of the quote, or the soundness of its argument, but I do think about it a lot, especially at this moment in my life.

I have no answers. I only have compassion for those who fear the intolerance of society, and loathing for the intolerant.

Just Causation

2022-12-06T00:00:00+00:00

Me: Does causation imply correlation?
ChatGPT: Yes, causation implies correlation. If one event causes another, then there must be a relationship or correlation between the two events. Causation is the relationship between two events in which one event causes the other to occur. This means that if event A causes event B, there must be a correlation between the two events, because the presence of A is necessary for B to occur.

Correlation doesn’t imply causation is a dictum that’s been repeated to death in statistics classes everywhere. But does that mean causation necessarily implies correlation? Initially, I too would have had a similar answer to ChatGPT above, but as I’ve discovered, the answer depends on what you mean by ‘correlation’. To be precise, and its always good to be precise with math and statistics, causation does not necessarily imply correlation.

Correlation generally refers to linear correlation, as in Pearson’s correlation, which only measures linear relationships between variables. However, its perfectly possible to have one variable to have only a non-linear dependency on another, but the corresponding linear correlation would be zero. While we do have non-linear correlation measures, mutual information is the measure that we want in our new dictum: Causation implies high mutual information.

I’m comforted that I’m not the only one who made the error of not being clear and precise with my words (and thus my understanding) in statistics. Nobel prize winners do it too.

An aside about ChatGPT

The first time I tried the causation prompt with ChatGPT, it didn’t understand the question, and proceeded to give a rote explanation of why correlation doesn’t imply causation. The fact that these models are easily tipped off by this sort of diversionary tactic (as Venelin explored in the adversarial QA paper) dissuades me the most from the idea that there is latent language understanding and meaning that can arise from the simple task of language modeling. However, when I tried it again before writing this post, ChatGPT got it right! I managed to come up with another one that tricked it along the same lines though:

Me: What came second, the chicken or the egg
ChatGPT: The question of which came first, the chicken or the egg, is a classic conundrum. It is often used as a metaphor for a problem that has no clear solution, or for a situation in which it is difficult to determine the cause and effect. In this case, there is no definitive answer, as the question itself is based on a false premise. Chickens hatch from eggs, so a chicken must have come from an egg. However, the egg that the chicken came from must have been laid by a chicken, so the chicken must have come first. This cycle continues indefinitely, making it impossible to determine which came first.

Build to Learn

2022-11-29T00:00:00+00:00

Andrej Karpathy on the Lex Fridman podcast, when asked how beginners should approach learning machine learning:

I am a believer at a high level of the 10,000 hours concept…You literally have to put in 10,000 hours of work…

Now, while Malcolm Gladwell does peddle a lot of crap (see Gelman), the 10,000 hours idea is reasonable. Keep working on something for a few hours everyday for a few years, and you’ll probably be really, really good at it at the end. I think it’s good advice, to which I’d add a small aphorism. The best way to learn a technical concept in software, machine learning and related fields, is to build something with it. I learnt SwiftUI and Combine by building DeTeXt. I learnt Javascript by building websites and web apps. Most of my understanding of transformers and modern deep neural nets comes from trying to use them in my research projects. So yes, spend a lot of time consistently on a topic, and try building something while you’re learning it.

I want to be a good writer, and the only way forward towards that goal is to write more. But what am I building? This blog? This doesn’t feel like a cohesive unit that I’m building, not that it has to be one. Perhaps I’ll keep chipping away at the 10,000 hours by writing regularly here, until I find something I want to build.

Game-y game

2022-10-09T00:00:00+00:00

I’ve gotten back into gaming recently, and realized something while I played the brilliant metroidvania game Dandara: Trials of Fear+ – video games fall under a spectrum of game-y-ness just like any other category. This was a concept the great Greg Carlson introduced to me in a Pragmatics class: there are prototypical examples of a category that help us cognitively to classify things in the world. So while penguins and magpies are birds, a magpie is very bird-y bird, while a penguin is less so. Similarly, Dandara is more of a game-y game, than say, Inside or Firewatch as dunkey bemoans in the beginning of this video. This isn’t a novel thought, but it’s another way of thinking past the uninteresting are video games art? question.

DeTeXt, a year later

2021-11-06T00:00:00+00:00

A year and 2 months ago, I released DeTeXt – my first iOS app that I had built using SwiftUI, Combine, CoreML and PencilKit. Thinking back, I believe I went from no-code to an app in the App Store in 6 weeks? I can hardly believe it now, but I am proud of building DeTeXt – I’m glad it helped some people find the LaTeX symbol they wanted and saved perhaps a few seconds of their day. Building and releasing DeTeXt emphasized one aspect of myself – I love building tools and working on projects that help other people be successful at their work, even in very small ways.

Initially, I had trained the MobileNetV2 neural network for DeTeXt’s symbol recognition engine on a 4 GPU cluster. Today, I did the same on my M1 Pro Macbook Pro running on battery, thanks to tensorflow-metal and the 16 core GPU. I wish I had documented the training better on the cluster, but I do have the training time for the M1 Pro: 6 minutes per epoch, for training a simple image classification neural net on over 200,000 images. In 1 epoch, the network reached an accuracy of 65%. Six minutes to train a complex neural network with over 7 million parameters, on battery power. Wow.

Thinking about attention again

2021-10-20T00:00:00+00:00

I’ve read, and re-read, CGP Grey’s blog post (a.k.a Project Cyclops) about wrestling back his focus and attention from online systems that feed on it. I’ve tried it a few times before, but this is the first time I’ve written about it publicly. So, effective immediately:

No hacker news
No reddit
No twitter
No MacRumors, ArsTechnica, Techmeme, AnandTech…

These activities are fine (but with some conditions):

Podcasts (only while running, or at the gym)
YouTube (during meals)
NetNewsWire (once a day)

Why am I doing this? For the same reason CGP Grey spelt out in his blog — I want to spend more time doing things I care about: research & study, reading, writing, cooking, building easy-to-use and delightful software tools and apps, and watching films.

Update

This is much harder than I thought it would be. I find myself replacing the time I would sink into the outlawed activities with others that seem ‘alright’ — Wikipedia, Letterboxd… There isn’t any magical solution, other than to be mindful with what I’m doing.

Two out of Five

2021-05-17T00:00:00+00:00

Phil Price over at Andrew Gelman’s blog had an interesting post about a certain unnamed principle that caught my attention:

Many years ago I saw an ad for a running shoe (maybe it was Reebok?) that said something like “At the New York Marathon, three of the five fastest runners were wearing our shoes.” I’m sure I’m not the first or last person to have realized that there’s more information there than it seems at first. For one thing, you can be sure that one of those three runners finished fifth: otherwise the ad would have said “three of the four fastest.” Also, it seems almost certain that the two fastest runners were not wearing the shoes, and indeed it probably wasn’t 1-3 or 2-3 either: “The two fastest” and “two of the three fastest” both seem better than “three of the top five.” The principle here is that if you’re trying to make the result sound as impressive as possible, an unintended consequence is that you’re revealing the upper limit.

My first thought was naturally that this sentence is an instance of a scalar implicature. The unintended consequence is a scalar implicature that arises because of a violation of the maxim of quantity. Reebok doesn’t want to say “The third, fourth and fifth fastest runners of the marathon were wearing our shoes” (in the worst case), so they choose to say this instead. We, as Gricean listeners are licensed to derive the hidden inference because of the form of the utterance.

But does Reebok want us to make the inference that one of those three runners finished fifth? Of course not. The construction “three of the five fastest runners” is interesting for this reason – it doesn’t hide the inference, but it weakens it. Consider the alternatives:

Some of the five fastest runners were wearing our shoes.
Three of the fastest runners were wearing our shoes.
Three of the five fastest runners were wearing our shoes.

(3) seems ideal – it’s technically the truth, and it sounds more impressive than (1) and (2). But I can’t shake the feeling that I’m missing something obvious in my thinking. For one, the partitive construction “Three of the five” should lead to a probabilistically stronger inference as Judith Degen has shown in her work. Also (3) sounds more impressive than (2) right? But why is the inference in (3) non-obvious on a first reading of the marketing slogan? Is there an effect due to the usage of the numeral determiners three and five? Assuming the truth of the statement, is Forty of the fifty fastest runners were wearing our shoes as impressive as (3)?

I’ll need to see what work has been done on these constructions, but sentences of this form seem to illustrate Degen’s non-homogeneity assertion: not all scalar implicatures have the same strength.

Update

Andrew Gelman makes an interesting point in reply to my question of why Reebok would use this phrasing when it could lead to a weaker implicature: people know that something is up but we’re still taken in, similar to how we’re more likely to buy something if its ₹99 rather than ₹100.

Anti-authoritarian

2021-04-21T00:00:00+00:00

Vaibhav Vats in Caravan magazine, on the pathetic orchestrated tweets by Indian celebrities following Rihanna’s tweet bringing attention to the farmer’s protests:

Tendulkar became the epitome of values prized in the conventional, hierarchal and self-congratulatory milieu of the middle class, showing no eagerness to challenge the many prejudices of society and state. His notion of ethics remained limited purely to the realm of his own personal conduct … this personal decency has always been accompanied by a deeply ingrained timidity towards authority, a primal fear of upsetting any establishment, whether cricketing or otherwise.

This comment on the middle class resonated with me: respect authority, don’t step out of line, don’t cause trouble, etc. Why risk being anti-authoritarian when you can maintain your place in the social hierarchy by being obedient to authority?

Money Laundering for Bias

2020-12-19T00:00:00+00:00

It’s terrifying to choose an area of study for your PhD research – what if it turns out to be a dead end in a year? That thought is inescapable as I start (in earnest) studying the Linguistic Intergroup Bias.

At this critical juncture in my research (and life), I’m glad I found this talk by Maciej Ceglowski. It might seem overtly simplistic, but I appreciate its sentiment, and find it useful to keep in mind when studying bias from a computational perspective:

Instead of relying on algorithms, which we can be accused of manipulating for our benefit, we have turned to machine learning, an ingenious way of disclaiming responsibility for anything. Machine learning is like money laundering for bias. It’s a clean, mathematical apparatus that gives the status quo the aura of logical inevitability. The numbers don’t lie.

False Friends

2020-11-02T00:00:00+00:00

pardés in Hebrew:

orchard, grove, fruit garden

paradesi in Tamil:

Foreigner, stranger … 2. Sojourner, pilgrim, traveller;

What the latter definition leaves out is that paradesi is almost always used as a derogatory word. Are the two words cognates? The Hebrew word’s roots are Proto-Iranian as the etymology describes, while the Tamil word is probably derived from Sanskrit. Proto-Iranian and Sanskrit are both theorized to have developed from Proto-Indo-European. They are probably false friends — true cognates with different meanings due to semantic change.

My interest in these two words began when I learned about the Paradesi Jews of India — is it an accident that a word with negative connotations ties back to people who faced persecution and antisemitism? Maybe, but the sad truth today is that few people in Chennai are aware of its Jewish past. History is the story of people - many stories are tragic but deserve to be remembered.

Don’t Make Me Talk

2020-10-31T00:00:00+00:00

John Siracusa in episode 19 of his pre-eminent podcast Hypercritical:

What were the earliest mass-market PC interfaces like?…they were like conversations…you’d tell it what to do, it gives information back to you…That was the basic paradigm until the Macintosh…What it gave you was not a conversation, but a thing…you could poke the thing and see how it reacts…it worked like a physical object…

I really liked the comparison of early command-prompt user interfaces to conversations. It struck me that today’s AI assistants (Alexa, Google Assistant, Siri) are all based around having conversations. If these systems ever approached anything close to human intelligence and common-sense, perhaps having a conversation is the best way to interact with AI. But I wonder if there is a better interface to interact with AI? What’s the next leap from conversational AI? Perhaps Augmented Reality is the answer — artificial intelligence dispersed in our lived reality, giving us glancable information and the illusion of a physical object we can interact with. I wonder if this is why Apple is so bullish on AR as well.

Or maybe the best way to interact with artificial intelligence is the same way we interact with other people — using conversations.

Stop Using MTurk

2020-09-15T00:00:00+00:00

Why now? My thought process was simple: My research output thus far has been based on underpaid labour. Do I want that to continue?

For me, the answer is no, hence why I’m going to stop using MTurk. Perhaps there are alternatives that are fair to workers, that lead to good wages and job security. Linguistic data is important, and more so when it comes from a diverse population. But I don’t think paying miniscule amounts of money to obtain it is something to be proud of. We can do better.

Update

I’ve been thinking more about this topic after a research seminar discussion – would ensuring that workers get paid atleast $10 per hour fix the problem? It certainly ensures that they get paid a decent wage by you. A few requesters making the decision to pay more(and advertising their decision) isn’t going to make the median wage go up, nor does it do much to fix the numerous other issues with how the platform treats workers. It does help in a small way, and its the least evil thing you can do on the platform.

Update 2

Fair Work is a cool resource from Stanford’s HCI group that runs a server-side script to ensure your crowd workers are paid minimum wage of $15. I found it while reading this excellent opinion piece by Glenn Davis & Klint Kanopka at The Stanford Daily. I agree with their conclusion: its upon Amazon to change the incentive structure – requesters paying workers more and being transparent helps, but from the workers perspective, it will always be a race to the bottom.

Ongoing Genocide

2020-08-15T00:00:00+00:00

Howard Zinn in A People’s History of the United States (emphasis mine):

One can lie outright about the past. Or one can omit facts which might lead to unacceptable conclusions. Morison does neither. He refuses to lie about Columbus. He does not omit the story of mass murder; indeed he describes it with the harshest word one can use: genocide.

This passage, and especially the last line, made an unusual impression on me when I first read it, and still does. Perhaps it’s the fact that one word needs to do so much work. How can one word possibly convey the magnitude of death, destruction, and suffering? On the other hand, genocide, along with words like racist, do seem to work. Those in power (and the privileged) seem more upset over being labelled racist, or being accused of genocide, than the crimes themselves.

I was reminded of the passage when reading a report describing the forced labour of Uyghurs in China, which used the term — cultural genocide (or ethnocide) to describe the Chinese government’s practises. Words matter, and the crimes against Kashmiris and Dalits need to be called for what they are, regularly and loudly.

Can the decades of military occupation of Kashmir, and the torture and killing of Kashmiris be captured by these words? Or the centuries of discrimination and persecution directed at the Dalit community? Calling these crimes for what they truly are — ongoing genocide, may not be adequate to describe the extent of the horrors that the people have experienced, nor the generational trauma, but it is a good start.

Nuance Fucks

2020-08-03T00:00:00+00:00

I recently re-read Kieran Healy’s 2015 paper deriding the abundance of ‘nuance’ in U.S Sociological research. I love this paper because it forced me to think about how I may fall into some of the ‘nuance traps’ that he identifies, especially the nuance of the fine-grain:

First is the ever more detailed, merely empirical description of the world. This is the nuance of the fine-grain.

Is there a similar phenomenon of nuance rising in Computational Linguistics? Anecdotally, I do see the words nuance and fine-grain more often in new papers and talks. There is a way to verify this — the ACL Anthology provides a bibliography of articles with their titles and abstracts dating back to 1960s. Searching the bibliography using some simple regular expressions, we see that the proportion of articles that mention the words nuance or fine-grain in their titles or abstracts have indeed risen in the last 10 years (there were few mentions before 2000 so I’ve excluded them).

However, the bib file has some issues – older articles are generally missing their abstracts, so we only have their title to search from. Top people are trying to fix the missing abstract field issue, but until then, we can recalculate the proportion of ‘nuanced’ articles by only considering entries that have an abstract field in the bibliography. We observe the same trend:

I don’t think that all of these papers fall into the nuance traps that Healy describes. Nor do I think that this trend of exploring the subtleties and richness of language is bad for computational models and theories of language – my first paper is about exploring nuance within the theory of generics! However, I do fear that I regularly fall into the trap of valuing some research more because it is more nuanced.