Get in touch

Python Developer – AI & ETL Engineer with LangChain, Chroma, MongoDB

ASAP

Hanoi

Full Time

Overview

A results-driven Python Developer specializing in building AI-powered automation workflows using LangChain. Strong proficiency in document/email processing, ETL pipelinesMongoDBChroma, and data manipulation with Pandas. Bonus expertise in Salesforce plugin development with Java, enabling seamless integration with CRM workflows. Ideal for transforming unstructured communication into structured, actionable insights.

🛠️ Technical Skills

Languages & Frameworks

  • Python (Advanced – automation, AI pipelines, API development)
  • Java (for Salesforce plugin/custom integration)
  • FastAPI / Flask (API layer)
  • Pandas (ETL & data wrangling)

LLMs & AI Tools

  • LangChain (Chains, agents, tools, retrievers, RAG architecture)
  • OpenAI / Anthropic LLMs
  • Prompt Engineering
  • Chroma (Vector store – semantic search, metadata filtering)
  • Named Entity Recognition (NER), Summarization, Classification

Data & Storage

  • MongoDB (primary data store – structured/unstructured data, case management)
  • Chroma (vector search – for semantic document lookup)
  • Salesforce (data sync & plugin development via Java-based integrations)
  • JSON, YAML, and nested data structure manipulation

ETL & Data Pipelines

  • End-to-end pipeline building: extraction, transformation, and load
  • PDF, email, and file ingestion
  • Data hydration: merging case data with internal DBs, CRM, and APIs
  • Schedule and trigger-based processing

⚙️ Responsibilities They Can Own

Email & Document Parsing:

  • Extract metadata, content, attachments from email servers or shared drives
  • Parse PDFs, Word docs, HTML, plain text using reliable libraries

Case Generation Pipeline:

  • Transform raw unstructured inputs into structured, queryable „cases“
  • Use LLMs to classify, extract intent, and associate case metadata

Data Hydration & Enrichment:

  • Query MongoDB and external APIs (e.g., Salesforce) to enhance data completeness
  • Update or create Salesforce records via plugin interfaces if needed

AI Workflow Orchestration (LangChain):

  • Implement step-based chains to analyze case data
  • Integrate Chroma vector searches for semantic enrichment
  • Use LLM reasoning to output decisions/actions

ETL Management:

  • Build robust pipelines with retry logic, logging, and monitoring
  • Optimize large-scale data transformations using Pandas
  • Handle schema changes and maintain backward compatibility

💼 Bonus: Salesforce Plugin Development

  • Java experience with Salesforce plugin/API development
  • Able to push/pull data to/from Salesforce for contextual case enrichment
  • Can work with Salesforce Events, Flows, or REST APIs from both Java and Python layers

Become part of our team and apply now in just a few minutes!

Mobile working and flexible working hours

You have the opportunity to work remotely or in the office

Free drinks (coffee, water, etc.)

A variety of different drinks await you in our modern offices.

Corporate events

You are cordially invited to our team events over the year.

Modern work equipment

Powerful laptops and innovative software offer optimal working conditions for your success.

Training budget

We attach great importance to regular training and personal development opportunities.

30 vacation days

30 days of vacation offer you the perfect balance between professional demands and personal relaxation.

Ready to shape the future of property management with us?
We look forward to receiving your application and writing the next chapter of our success story together with you!

We look forward to receiving your application!

Please apply using our form or simply by email.

Elisabeth Rahi

HR & Marketing

Feel free to contact me if you have any further questions!