Odixcity Consulting is a Nigerian HR consulting and procurement firm that provides business solutions to companies, entrepreneurs, and SMEs. We offer a wide range of services including recruitment, performance management, training and development, compensation and benefits, payroll and benefits administration, and procurement of goods and services.
Job description:
We are seeking a technically skilled Multimodal Specialist (Vision / Audio / Video) to evaluate and validate annotations across image, video, and audio datasets. This role plays a key part in ensuring the accuracy and consistency of multimodal training data and model outputs.
Key Responsibilities:
- Review and validate image, video, and audio annotations
- Assess bounding boxes, segmentation masks, and object labeling accuracy
- Perform image segmentation QA and detect spatial inconsistencies
- Validate video events, temporal sequences, and frame-level annotations
- Conduct audio transcription QA and verify timestamp accuracy
- Score multimodal model outputs for correctness and quality
- Identify labeling inconsistencies, noise, and structural errors
- Provide structured feedback to improve annotation standards
Required Skills & Qualifications:
- Bachelor’s degree in Computer Science, Information Technology, or equivalent professional experience.
- 4+ years of experience working with vision, audio, or video datasets
- Familiarity with annotation tools (e.g., labeling platforms for bounding boxes, segmentation, transcription)
- Strong spatial and temporal reasoning skills
- High attention to detail and consistency in evaluation
- Ability to analyze large-scale multimodal datasets
Method of Application
Meet the Qualifications? Email your CVs and cover letters to [email protected] using the Job Title as the subject.