Data Annotation Engineer at Veryfi, Inc. (W17)
$150K  •  
APIs to Liberate Trapped Data in Unstructured Documents
San Mateo, CA, US / Medellín, Antioquia, CO / Remote (Medellín, Antioquia, CO; San Mateo, CA, US)
Full-time
Any (new grads ok)
About Veryfi, Inc.

Veryfi empowers organizations to transform their unstructured data in the form of receipts, invoices, purchase orders, checks, W2s and other business documents into structured data at scale. Their suite of data transformation APIs can be leveraged for many use cases in financial services to deliver valuable business intelligence in seconds. Trusted by enterprises and technology companies alike, Veryfi’s AI-based platform is being leveraged by companies worldwide.

Veryfi is backed by NewView Capital (NVC), Act One Ventures, TI PLatform, Y Combinator and Zillionize

Veryfi Raises $12 Million To Use AI To Tackle The Unstructured Data Entry Market https://www.forbes.com/sites/rebeccaszkutak/2021/04/26/veryfi-raises-12-million-to-use-ai-to-tackle-the-unstructured-data-entry-market/?sh=886fe19183f8

The Untapped Potential of Unstructured Data https://nvc.vc/perspectives/veryfi-the-untapped-potential-of-unstructured-data/

Capterra Reviews https://www.capterra.com/p/141684/Veryfi-Receipts-and-Expenses/reviews/

COME AND SAY G'DAY!

About the role
Skills: Machine Learning, Natural Language Processing, Data Warehousing

Veryfi is a YC-funded Silicon Valley startup that uses AI to understand documents like receipts and invoices. As a Data Engineer at Veryfi, you'll contribute to the evolution of our training data infrastructure and the development of new features and projects. You'll gather, process, and analyze diverse datasets to generate high-quality training data for our machine-learning models. Furthermore, by delving deep into our system, you'll have the autonomy to identify challenges and opportunities, taking ownership of developing solutions to refine existing tools and algorithms.

Key Responsibilities:

  • Gather, process, and analyze diverse datasets to generate training data that fuels the development of our ML projects.
  • Expand and optimize the training data pipelines to improve the speed and accuracy of our processes.
  • Collaborate with a cross-functional team to define requirements and prioritize development efforts.

Essential Skills:

  • Proficient in Python programming for data handling and processing, with experience in utilizing data science tools such as Pandas, NumPy, SciPy, and others.
  • Strong analytical thinking with a focus on delivering results.
  • Meticulous attention to detail, ensuring accuracy and precision in all data handling and processing tasks.
  • Enthusiastic about learning and adapting to new technologies and methodologies, particularly in the realm of Machine Learning (ML).
  • Innovation mindset, adept at challenging existing processes and driving positive change.

Preferred Qualifications:

  • Familiarity with regex development, software engineering principles, and Linux command line tools.
  • Experience with Natural Language Processing (NLP) techniques and libraries, including the use of Large - - -- - Language Models (LLMs) and supervised learning for document data extraction.
  • Effective organizational abilities, capable of managing projects independently from inception to completion.
  • Exceptional verbal and written communication skills, effectively communicating problems, proposed solutions, and results to stakeholders in a multicultural environment.

A Bachelor's degree in computer science, engineering, or a related field. Postgraduate studies are a plus but not required.

Keywords: NLP, Patterns Detection, Data Labeling, Software Development, Data Engineering.

Technology

(a) Native Mobile apps: Swift, Objective-C & Kotlin

(b) Backend: Python 3, TensorFlow, APIs on Django, Hub/Web on Flask

(c) IaaS: AWS with auto deploys to 4 geographies (read the deployment posts by Andrew here https://medium.com/the-road-to-silicon-valley)

(d) Database: Amazon Aurora

Other jobs at Veryfi, Inc.

fulltimeSan Mateo CaliforniaMachine Learning1+ years

fulltimeRemoteFull Stack3+ years

fulltimeSan Mateo, CA, US$90 - $1513+ years

fulltimeSan Mateo, California / RemoteFull Stack3+ years

fulltimeSan Mateo, CA / RemoteAndroid6+ years

fulltimeSan Mateo, CaliforniaMachine Learning11+ years

fulltimeSan Mateo, California / RemoteData Science6+ years

fulltimeSan Mateo, CAData Science3+ years

fulltimeSan Mateo, CAData Science3+ years

fulltimeSan Mateo, California / RemoteMachine Learning11+ years

fulltimeMedellín, Antioquia, CO / RemoteMachine Learning$24K - $36K1+ years

fulltimeSan Mateo, CA, US$90K - $100K6+ years

fulltimeSan Mateo, CA, USData Science$160K - $210K3+ years

fulltimeSan Mateo, California / RemoteiOS3+ years

fulltimeSan Mateo, California / RemoteiOS6+ years

fulltimeSF Bay Area / RemoteBackend3+ years

fulltimeSan Mateo, CA$60K - $90K1+ years

fulltimeSan Mateo, CA, US / Remote (PL; UA; HU; LV; LT; EE; BY; MD; SK; CZ; RO; BG; MK; RS; Kosovo, Municipality of Makedonski Brod, MK; AL; ME; BA; HR; SI; RU)Frontend$40K - $80K6+ years

fulltimeSan Mateo, CAFull Stack$150K - $190K6+ years

fulltimeSan Mateo, California / RemoteFull Stack6+ years

fulltimeCO / Remote (CO)$30K - $60K3+ years

fulltimeSan Mateo, CA, US / Medellín, Antioquia, CO / Remote (Medellín, Antioquia, CO; San Mateo, CA, US)Machine Learning$150KAny (new grads ok)

Hundreds of YC startups are hiring on Work at a Startup.

Sign up to see more ›