Government Tender for Development of Indigenous Gujarati Large Language Model (LLM)

INVITING EXPRESSION OF INTEREST (EOI) FOR SUBMISSION OF PROPOSAL ON DESIGN AND DEVELOPMENT OF INDIGENOUS LARGE LANGUAGE MODEL (LLM) FOR GUJARATI LANGAUGE

Type: Works
Period: As per tender Document Attached

Tender Details

  • GUJARAT INFORMATICS LIMITED, Block No. 2, 2nd Floor, C & D Wing, Karmayogi Bhavan Sector - 10 A, Gandhinagar
  • Submission Start:June 2, 2025 at 02:30 PM
  • Document Download End:June 25, 2025 at 03:00 PM
  • dgmapp-gil@gujarat.gov.in
Industry: Information Technology and Artificial Intelligence
Category: Works
Type: Works
Period: As per tender Document Attached

This government tender by Gujarat Informatics Limited seeks proposals from qualified IT and AI firms for the design and development of an indigenous Gujarati Large Language Model (LLM). The project aims to advance local language processing capabilities, supporting applications such as translation, voice recognition, and content generation. The initiative aligns with the government's vision to promote digital inclusivity and linguistic diversity. Bidders are expected to demonstrate expertise in transformer-based NLP models, data handling, and scalable AI solutions. The tender process is open, with detailed eligibility and technical criteria outlined in the official document. The contract includes development, deployment, and ongoing support, fostering innovation in regional language technology. Interested organizations should review all guidelines carefully and submit their proposals before the deadline to participate in this strategic project enhancing Gujarat’s digital infrastructure.

Overview

This government tender invites qualified IT and AI development firms to submit proposals for the design and development of an indigenous Large Language Model (LLM) tailored specifically for the Gujarati language. Organized by Gujarat Informatics Limited, a government enterprise, this project aims to enhance language processing capabilities and promote local language technology solutions. The tender targets organizations with expertise in natural language processing, machine learning, and AI development, particularly those experienced in creating large-scale language models. The project scope includes designing, training, and deploying a robust Gujarati LLM that can support various applications such as translation, voice recognition, and content generation. The initiative aligns with the government's objective to promote digital inclusivity and linguistic diversity. The tender process is open, with detailed eligibility criteria and technical specifications provided. Bidders are encouraged to review the comprehensive tender document linked below for detailed instructions and submission guidelines.

Scope of Work

Design and develop a large-scale Gujarati language model capable of understanding and generating natural language content.
Data collection and preprocessing: Gather diverse Gujarati language datasets, including textual, spoken, and multimedia sources.
Model training: Utilize state-of-the-art NLP techniques and machine learning frameworks to train the LLM.
Evaluation and validation: Conduct rigorous testing to ensure accuracy, contextual understanding, and language fluency.
Deployment: Implement the LLM into accessible platforms and applications.
Support and maintenance: Provide ongoing support, updates, and improvements post-deployment.

SpecificationDetails
Language FocusGujarati
Model TypeTransformer-based large language model
Data SourcesText corpora, voice data, multimedia
Training EnvironmentHigh-performance computing infrastructure
Delivery TimelineAs per tender document attached
1. Submit detailed technical proposal. 2. Demonstrate relevant experience and technical capability. 3. Provide a project implementation plan. 4. Ensure compliance with data privacy and security standards. 5. Complete the project within the stipulated completion period.

Eligibility Criteria

✓ Must have proven experience in developing large language models or NLP solutions.
✓ Must possess relevant certifications and technical qualifications.
✓ Must demonstrate successful completion of similar projects.
✓ Bidders should have a valid Digital Signature Certificate (DSC) as per IT Act 2000.

Qualification RequirementDetails
ExperienceMinimum 3 years in NLP/AI projects
CertificationValid Digital Signature Certificate (DSC)
Financial CapacityAnnual turnover of at least ₹5 crore in the last 3 years
Technical CapabilityProven expertise in transformer models, machine learning, and data processing
**Important:** Bidders must attach scanned copies of all relevant certificates and documents as specified in the tender document.

Technical Specifications

The proposed Gujarati LLM should adhere to the following technical parameters:

ParameterSpecification
Model ArchitectureTransformer-based, scalable
Data VolumeMinimum 10 million Gujarati text samples
Training HardwareGPUs with at least 32 GB VRAM
Model SizeNot less than 1 billion parameters
OutputFluent, contextually accurate Gujarati language generation
Sequential process: 1. Data collection and cleaning. 2. Model architecture design. 3. Model training and tuning. 4. Validation and testing. 5. Deployment and integration.

Financial Requirements

The estimated budget for this project ranges between ₹50 lakh to ₹1 crore, covering data acquisition, model development, training infrastructure, and deployment. The detailed cost breakdown is as follows:

Cost ComponentEstimated Cost
Data Collection & Processing₹10 lakh
Model Development & Training₹30 lakh
Hardware & Infrastructure₹15 lakh
Deployment & Support₹5 lakh
Payment terms are as follows: - 30% advance upon contract signing. - 40% after completion of model training. - Remaining 30% post successful deployment and acceptance. All payments are subject to submission of progress reports and deliverables as per the project milestones.

Bidding Process

  1. Interested bidders must download the tender document from the official portal.
  2. Submit the technical and financial bids electronically before the deadline.
  3. Ensure all required documents and certificates are uploaded.
  4. Bidders may attend a pre-bid meeting scheduled as per the timeline.
  5. The evaluation process will commence immediately after bid submission closes.
TimelineDate & Time
Bid Submission Start02-06-2025 14:30
Bid Submission End25-06-2025 15:00
Evaluation & Shortlisting26-06-2025 to 30-06-2025
Final Selection & Award01-07-2025
**Important:** Bidders must ensure compliance with all submission guidelines and deadlines to avoid disqualification.

Evaluation Criteria

Bids will be evaluated based on the following criteria:

CriteriaWeightage
Technical expertise and experience40%
Proposed methodology and innovation30%
Cost competitiveness20%
Past project success and references10%
Minimum qualifying marks for technical evaluation: 70%. The highest-scoring bid will be awarded the contract. The evaluation panel will consider the bidder’s experience, technical approach, compliance with specifications, and financial proposal. Bidders must meet all eligibility and technical criteria to qualify for the award.

Important Dates

DateEventDetails
02-06-2025Tender PublicationTender available for download from official portal
25-06-2025Bid Submission DeadlineLast date for bid submission at 15:00
26-06-2025Evaluation CommencementEvaluation process begins
01-07-2025Award NotificationContract awarded to successful bidder
05-07-2025Project CommencementExpected start date for project execution
**Note:** All dates are subject to change; bidders are advised to regularly check official updates.

Contact Information

For queries and clarifications, bidders can contact:

Contact PersonDesignationContact Details
DGM (App)Gujarat Informatics LimitedPhone: 07923255950
Emaildgmapp-gil@gujarat.gov.in
Address: GUJARAT INFORMATICS LIMITED, Block No. 2, 2nd Floor, C & D Wing, Karmayogi Bhavan Sector - 10 A, Gandhinagar, Gujarat. Bidders are encouraged to seek assistance during the vendor training sessions or via email for technical support related to bid submission.

Additional Information

This tender is part of the government’s initiative to promote local language technology solutions and digital inclusivity. Bidders are advised to review the detailed tender document linked below for comprehensive instructions, terms, and conditions. All submissions must adhere strictly to the specified formats and deadlines. The project aims to foster innovation in natural language processing for regional languages, with a focus on Gujarat's linguistic diversity. Participation from organizations with proven expertise in AI, NLP, and large language model development is highly encouraged. Ensure compliance with all eligibility and technical requirements to enhance your chances of selection.

Industry & Sector Information

Primary Industry

Information Technology and Artificial Intelligence

Sectors

State Governments & UT

Related Industries

Natural Language Processing
Machine Learning
Language Technology

Target Audience

AI and NLP technology companies
Large language model developers
Research organizations specializing in regional languages
IT firms with experience in machine learning
Language technology startups

General Information

Item Category
Information Technology (IT)
State

Financial Information

Bid Offer Validity
180 Days

Evaluation and Technical Information

Inspection Required
No

Tender Documents

1 Document
Draft_EoI_Gujarati LLM_02.06.2025 V2.pdfTENDER_DOCUMENT

Tender Stages

Tender Stages

stage nameevaluation dateminimum forms for submission
Eligibility Criteria and proposal from the bidder25-06-2025 15:300

Similar Tenders

View All

Frequently Asked Questions

The primary goal is to develop an indigenous Gujarati Large Language Model (LLM) to enhance language processing capabilities, support local language applications, and promote digital inclusivity for Gujarat. This includes designing, training, and deploying a robust NLP solution tailored for the Gujarati language.

Eligible participants include IT companies, AI and NLP research organizations, technology firms with experience in large language models, and institutions holding valid Digital Signature Certificates (DSC). Bidders must demonstrate relevant experience and technical expertise in NLP, machine learning, and large-scale model development.

The model should be transformer-based, scalable, trained on at least 10 million Gujarati text samples, and capable of generating fluent language content. Hardware requirements include GPUs with minimum 32 GB VRAM, and the model size should be not less than 1 billion parameters. The model must support various NLP applications such as translation and voice recognition.

Bidders must download the tender document, submit their technical and financial proposals electronically before 25th June 2025, 15:00, and ensure all documents are complete. The evaluation begins immediately afterward, with the project expected to start by 5th July 2025. Detailed timelines are available in the tender notice.

Bids will be assessed based on technical expertise (40%), methodology and innovation (30%), cost (20%), and past project success (10%). A minimum of 70% in technical evaluation is required to qualify for the final selection. The highest-scoring compliant bid will be awarded the contract.

Probable Bidders

Companies likely to bid

Get Tender Alerts

Get notifications for similar tenders