Government Tender for Development of Indigenous Gujarati Large Language Model (LLM)
INVITING EXPRESSION OF INTEREST (EOI) FOR SUBMISSION OF PROPOSAL ON DESIGN AND DEVELOPMENT OF INDIGENOUS LARGE LANGUAGE MODEL (LLM) FOR GUJARATI LANGAUGE
Tender Details
- GUJARAT INFORMATICS LIMITED, Block No. 2, 2nd Floor, C & D Wing, Karmayogi Bhavan Sector - 10 A, Gandhinagar
- Submission Start:June 2, 2025 at 02:30 PM
- Document Download End:June 25, 2025 at 03:00 PM
- dgmapp-gil@gujarat.gov.in
This government tender by Gujarat Informatics Limited seeks proposals from qualified IT and AI firms for the design and development of an indigenous Gujarati Large Language Model (LLM). The project aims to advance local language processing capabilities, supporting applications such as translation, voice recognition, and content generation. The initiative aligns with the government's vision to promote digital inclusivity and linguistic diversity. Bidders are expected to demonstrate expertise in transformer-based NLP models, data handling, and scalable AI solutions. The tender process is open, with detailed eligibility and technical criteria outlined in the official document. The contract includes development, deployment, and ongoing support, fostering innovation in regional language technology. Interested organizations should review all guidelines carefully and submit their proposals before the deadline to participate in this strategic project enhancing Gujarat’s digital infrastructure.
Overview
This government tender invites qualified IT and AI development firms to submit proposals for the design and development of an indigenous Large Language Model (LLM) tailored specifically for the Gujarati language. Organized by Gujarat Informatics Limited, a government enterprise, this project aims to enhance language processing capabilities and promote local language technology solutions. The tender targets organizations with expertise in natural language processing, machine learning, and AI development, particularly those experienced in creating large-scale language models. The project scope includes designing, training, and deploying a robust Gujarati LLM that can support various applications such as translation, voice recognition, and content generation. The initiative aligns with the government's objective to promote digital inclusivity and linguistic diversity. The tender process is open, with detailed eligibility criteria and technical specifications provided. Bidders are encouraged to review the comprehensive tender document linked below for detailed instructions and submission guidelines.
Scope of Work
• Design and develop a large-scale Gujarati language model capable of understanding and generating natural language content.
• Data collection and preprocessing: Gather diverse Gujarati language datasets, including textual, spoken, and multimedia sources.
• Model training: Utilize state-of-the-art NLP techniques and machine learning frameworks to train the LLM.
• Evaluation and validation: Conduct rigorous testing to ensure accuracy, contextual understanding, and language fluency.
• Deployment: Implement the LLM into accessible platforms and applications.
• Support and maintenance: Provide ongoing support, updates, and improvements post-deployment.
Specification | Details |
---|---|
Language Focus | Gujarati |
Model Type | Transformer-based large language model |
Data Sources | Text corpora, voice data, multimedia |
Training Environment | High-performance computing infrastructure |
Delivery Timeline | As per tender document attached |
Eligibility Criteria
✓ Must have proven experience in developing large language models or NLP solutions.
✓ Must possess relevant certifications and technical qualifications.
✓ Must demonstrate successful completion of similar projects.
✓ Bidders should have a valid Digital Signature Certificate (DSC) as per IT Act 2000.
Qualification Requirement | Details |
---|---|
Experience | Minimum 3 years in NLP/AI projects |
Certification | Valid Digital Signature Certificate (DSC) |
Financial Capacity | Annual turnover of at least ₹5 crore in the last 3 years |
Technical Capability | Proven expertise in transformer models, machine learning, and data processing |
Technical Specifications
The proposed Gujarati LLM should adhere to the following technical parameters:
Parameter | Specification |
---|---|
Model Architecture | Transformer-based, scalable |
Data Volume | Minimum 10 million Gujarati text samples |
Training Hardware | GPUs with at least 32 GB VRAM |
Model Size | Not less than 1 billion parameters |
Output | Fluent, contextually accurate Gujarati language generation |
Financial Requirements
The estimated budget for this project ranges between ₹50 lakh to ₹1 crore, covering data acquisition, model development, training infrastructure, and deployment. The detailed cost breakdown is as follows:
Cost Component | Estimated Cost |
---|---|
Data Collection & Processing | ₹10 lakh |
Model Development & Training | ₹30 lakh |
Hardware & Infrastructure | ₹15 lakh |
Deployment & Support | ₹5 lakh |
Bidding Process
- Interested bidders must download the tender document from the official portal.
- Submit the technical and financial bids electronically before the deadline.
- Ensure all required documents and certificates are uploaded.
- Bidders may attend a pre-bid meeting scheduled as per the timeline.
- The evaluation process will commence immediately after bid submission closes.
Timeline | Date & Time |
---|---|
Bid Submission Start | 02-06-2025 14:30 |
Bid Submission End | 25-06-2025 15:00 |
Evaluation & Shortlisting | 26-06-2025 to 30-06-2025 |
Final Selection & Award | 01-07-2025 |
Evaluation Criteria
Bids will be evaluated based on the following criteria:
Criteria | Weightage |
---|---|
Technical expertise and experience | 40% |
Proposed methodology and innovation | 30% |
Cost competitiveness | 20% |
Past project success and references | 10% |
Important Dates
Date | Event | Details |
---|---|---|
02-06-2025 | Tender Publication | Tender available for download from official portal |
25-06-2025 | Bid Submission Deadline | Last date for bid submission at 15:00 |
26-06-2025 | Evaluation Commencement | Evaluation process begins |
01-07-2025 | Award Notification | Contract awarded to successful bidder |
05-07-2025 | Project Commencement | Expected start date for project execution |
Contact Information
For queries and clarifications, bidders can contact:
Contact Person | Designation | Contact Details |
---|---|---|
DGM (App) | Gujarat Informatics Limited | Phone: 07923255950 |
dgmapp-gil@gujarat.gov.in |
Additional Information
This tender is part of the government’s initiative to promote local language technology solutions and digital inclusivity. Bidders are advised to review the detailed tender document linked below for comprehensive instructions, terms, and conditions. All submissions must adhere strictly to the specified formats and deadlines. The project aims to foster innovation in natural language processing for regional languages, with a focus on Gujarat's linguistic diversity. Participation from organizations with proven expertise in AI, NLP, and large language model development is highly encouraged. Ensure compliance with all eligibility and technical requirements to enhance your chances of selection.
Industry & Sector Information
Primary Industry
Sectors
Related Industries
Target Audience
General Information
Financial Information
Evaluation and Technical Information
Tender Documents
1 DocumentTender Stages
Tender Stages
stage name | evaluation date | minimum forms for submission |
---|---|---|
Eligibility Criteria and proposal from the bidder | 25-06-2025 15:30 | 0 |
Similar Tenders
Frequently Asked Questions
The primary goal is to develop an indigenous Gujarati Large Language Model (LLM) to enhance language processing capabilities, support local language applications, and promote digital inclusivity for Gujarat. This includes designing, training, and deploying a robust NLP solution tailored for the Gujarati language.
Eligible participants include IT companies, AI and NLP research organizations, technology firms with experience in large language models, and institutions holding valid Digital Signature Certificates (DSC). Bidders must demonstrate relevant experience and technical expertise in NLP, machine learning, and large-scale model development.
The model should be transformer-based, scalable, trained on at least 10 million Gujarati text samples, and capable of generating fluent language content. Hardware requirements include GPUs with minimum 32 GB VRAM, and the model size should be not less than 1 billion parameters. The model must support various NLP applications such as translation and voice recognition.
Bidders must download the tender document, submit their technical and financial proposals electronically before 25th June 2025, 15:00, and ensure all documents are complete. The evaluation begins immediately afterward, with the project expected to start by 5th July 2025. Detailed timelines are available in the tender notice.
Bids will be assessed based on technical expertise (40%), methodology and innovation (30%), cost (20%), and past project success (10%). A minimum of 70% in technical evaluation is required to qualify for the final selection. The highest-scoring compliant bid will be awarded the contract.
Probable Bidders
Get Tender Alerts
Get notifications for similar tenders