Datamuse is a tiny software venture born in the early days of the Web and still stubbornly stuck there. We make brainstorming tools like RhymeZone, OneLook, and the Datamuse API, that collectively reach more than 10 million writers every month. (Here's a recap of some of the projects we've been working on during the past few years.)

We’re looking for contract programmers — computer science students or seasoned software developers — to help us improve our sites in 2022 and build entirely new ones. We hire contractors on a project basis and will consider longer-term engagements if there's mutual interest. Do you have experience in, or enthusiasm for, natural language processing, machine learning, or data visualization? Please read the project descriptions below, and if any of them pique your interest, give us a sense of your experience, time frame, and desired terms/rate in the feedback box at the bottom of this page. Thank you!


Research

OneLook Thesaurus language modeling: This project is suited for someone studying natural language processing and interested in applying their skills to improve a live Web application. You will experiment with using large language models such as GPT-3 to increase the quality and breadth of the results returned for certain kinds of queries made to OneLook Thesaurus. You will be responsible for implementing and evaluating your solutions with data resources that we provide. Project scope: 40+ hours over 3 months (open-ended); Desired skills: Proficiency in Python; experience with processing text data; machine learning basics.

OneLook Thesaurus internationalization: This project is suited for someone studying natural language processing and interested in applying their skills to improve a live Web application. You will be responsible for implementing and evaluating your solutions with data resources that we provide, and seeking out additional data resources in the target language. Project scope: 20+ hours over 2 months (open-ended); Desired skills: Proficiency in Python; experience with processing text data; machine learning basics; proficiency in Spanish, French, or Mandarin Chinese.

Spruce: We're working to make our Spruce brainstorming tool more comprehensive, more accurate, and faster. For this project you will work with our back-end semantic search and indexing code to scale Spruce to find connections between any piece of writing and hundreds of millions of documents. Desired skills: Proficiency in Python; some experience with large-scale data pipelines; data analysis experience.

Front-end development

Brainstorming viz: For this project, a web developer familiar with D3.js or similar visualization frameworks will implement a graphical output mode for OneLook Thesaurus wherein similar terms are grouped into clusters and related clusters are linked to each other in an interactive visualization. Project scope: 80 hours over 2 months (design + implementation of prototype). Desired skills: Proficiency in web design design and/or development; ReactJS.

Google Docs add-on: For this project, a developer familiar with Google Apps Script will improve the quality and reliability of our Google Docs add-on. Project scope: 40+ hours. Desired skills: Some familiarity with Google Apps Script and past experience deploying similar extensions or add-ons.

New microsite: For this project, a front-end web developer or designer will help us get a new website off the ground by designing, prototyping, and implementing an application around back-end APIs that we provide. Project scope: 40+ hours. Desired skills: Proficiency in web design design and/or development; ReactJS.

Infrastructure

Java performance tuning: We're looking for a Java server performance expert to help us reduce the latency of the Datamuse API. You will use tools such as CodeGuru to profile live and synthetic query loads, and implement and test changes to our API codebase accordingly. Project scope: 40 hours. Desired skills: Up-to-date Java expertise.

Self-hosted LLM: We're looking for a systems-oriented machine learning engineer to help us deploy large language models suited for our applications on our own cloud resources. This project will involve evaluating various open source distilled large language models on our own benchmarks and deploying them in our cloud, optimizing for a combination of accuracy, latency, and serving cost. Project scope: 40+ hours. Desired skills: Experience deploying large-scale machine learning models.



Do any of these contract projects pique your interest?
Please give us a sense of your experience, time frame, and desired terms in the feedback box below.




Email address:
(We'll only ever use this email address to send you a reply.)