M47 Labs is an AI Engineering company focused on providing businesses with the latest and most advanced Artificial Intelligence solutions. Our deep expertise in AI-Language Technologies, including NLP and LLMs, empowers businesses with custom-built cutting-edge solutions. We are dedicated to creating AI-driven applications that can understand, interpret, and respond to human language. Join us in making the future more intelligent.
About you
You're an AI/ML Engineer experienced with various AI models, from big names like GPT, Anthropic, PaLM and other open-source models like LLaMA, you've fine-tuned the LLaMA model and GPT-3.5 for particular applications, and you have hands-on experience deploying these models on cloud platforms, specifically AWS and Azure.
Every day there's a lot of new information in AI, from research papers to models. It's hard to keep up. You've used tools like LangChain, LlamaIndex and Pinecone to help you manage this. With the rise of JavaScript developers, you also know about newer tools like LangChain.js, Transformers.js, and Vercel’s AI SDK.
You understand the challenges and possibilities that LLMs bring to the table and are passionate about harnessing their potential in end-to-end solutions. If spearheading projects that push the boundaries of what LLMs can achieve in a full-stack environment excites you, we are eager to co-create the future together.
About your day-to-day:
- Implement and integrate large language models into web-based applications and other platforms, ensuring optimal performance and user experience. 
- Critically select the best third-party AI and LLM models to perform AI solutions. 
- Fine-tune language models for specific applications, ensuring they align with project goals. 
- Monitor the performance of integrated models, looking out for latencies and ensuring user queries receive timely and accurate responses. 
- Handle large datasets, ensuring proper storage, access, and processing capabilities, especially when training or refining models. 
- Implement optimization techniques to ensure efficient model inference, reducing computational costs and improving response times. 
- Design and implement solutions that can scale to handle a high number of simultaneous user queries without degrading performance. 
- Collaborate with Back-End Engineers to ensure the infrastructure supports the demands of the language model, especially concerning GPU resources, memory, and storage. 
- Stay updated with advancements in the field of large language models, bringing in new techniques and practices to improve integration and user experience. 
What is in it for you?
💪🏽 Indefinite full-time contract
☀️ Office located at the heart of Barcelona
🚀 Follow your career ambition with growth opportunities (horizontal and vertical)
📚On-demand learning budget and ongoing educational company-wide training of relevant topics of our industry
💸 Comprehensive compensation package, including private medical insurance coverage and flexible remuneration through Cobee including meals, gym pass, transport and kindergarten.
🌈 Be part of our diverse communities and enjoy our meetups (Women in Tech, LGTBQ+, Wellbeing, City Lifestyle...)
🌍 Great international, inclusive and dynamic work environment (more than 20 nationalities!)