weniweni.com

SafeSearch Not set

New Arrivals/Restock

Evaluating AI Systems: Testing LLMs, RAG, and Agents Kindle Edition

4.3 (20 items)

Limited Time Sale

Until the end

New $9.99 (tax included) Number of stocks: 1

Used $4.00 (tax included) New Arrivals and Restocks Number in stock: 1

$5.99 cheaper than the new price!!

Free shipping for purchases over $99 ( Details )
Free cash-on-delivery fees for purchases over $99

Other shops (12) $99 ~

See all stores

Please note that the sales price and tax displayed may differ between online and in-store. Also, the product may be out of stock in-store.

New $9.99

Product details

Management number	220491396	Release Date	2026/05/03	List Price	$4.00	Model Number	220491396
Category	Kindle Store Kindle eBooks Computers & Technology Programming Software Design, Testing & Engineering Software Development

The definitive guide to testing AI systems that actually work.Most AI systems ship without meaningful evaluation. Teams eyeball a few responses, declare the system "good enough," and push to production. Then quality degrades, hallucinations appear, and nobody knows why.Evaluating AI Systems is a practical, technical guide to building evaluation frameworks for LLMs, RAG pipelines, and AI agents. Written by Alex Merced, Head of Developer Relations at Dremio and author of multiple technical books, it covers the full evaluation lifecycle from dataset generation to production monitoring.What you will learn:Understand why traditional software testing fails for AI and what to do insteadBuild golden evaluation datasets that accurately measure system qualityImplement prompt testing with tools like DeepEval, RAGAS, and promptfooDesign evaluation metrics for correctness, faithfulness, relevance, and safetyDetect and measure hallucinations with automated pipelinesUse LLM-as-judge patterns with bias mitigation and multi-model consensusBuild regression testing that catches quality degradation before users doDeploy production monitoring with drift detection and quality alertingEvaluate multi-step agent workflows with tool use accuracy metricsManage evaluation costs with tiered strategies from smoke tests to deep expert reviewsWritten with verified specifications for GPT-5.4, Claude Sonnet 4.6, and Gemini 3.1 Pro throughout. Every technique is immediately applicable to production AI systems.For AI engineers: Build evaluation pipelines that prevent quality incidents.For QA engineers: Apply testing discipline to the most untestable systems you have ever worked with.For engineering managers: Make informed quality decisions with data, not gut feeling. Read more

XRay	Not Enabled
Edition	1st
Language	English
File size	10.3 MB
Page Flip	Enabled
Publisher	Alex Merced Books
Word Wise	Not Enabled
Print length	372 pages
Accessibility	Learn more
Screen Reader	Supported
Publication date	March 16, 2026
Enhanced typesetting	Enabled

Correction of product information

If you notice any omissions or errors in the product information on this page, please use the correction request form below.

Correction Request Form

Product Review

You must be logged in to post a review

4.3 ( 20 items )

	15 items
	5 items
	2 items
	1 items
	0 items

Sort
keyword

There are currently no product reviews.

Shipping Rates

Order Amount	Shipping Fee	Handling Fee
Under $99	$12.99	$24.00
$99 - $499	FREE	$24.00
$500 and above	FREE	FREE

Delivery Time

Standard Shipping: 5-7 business days
Express Shipping: 2-3 business days (additional $15)
Overnight Shipping: Next business day (additional $35)

Available Regions

We ship to all 50 US states, Canada, and select international destinations through our partner Neokyo.

Diameter	12 feet (3.66m)
Height	30 inches (76cm)
Water Capacity	1,718 gallons (6,500L)
Weight (Empty)	42 lbs (19kg)

Evaluating AI Systems: Testing LLMs, RAG, and Agents Kindle Edition

Product details

Bestseller ranking

Kids' Bath Towels

MooMoo Baby Kid Bath Towel Hooded Absorbent Toddler Beach Towel 50’ X 30’ Extra Large Cotton Poncho Towel Swim Coverups Bathrobe for Boy (Car)

Hiturbo Kids Bath Towel Hoodie: Soft Plush Zipper Surf Poncho - Warm Fleece Changing Towels Swimming Outerwear (Navy)

Cabilock Extra Thick Coral Fleece Bath Towel Set with Cow Print, Super Absorbent Large Swimming and Bath Towels for Boys

Hiturbo Kids Changing Towel Poncho: Quick Dry Absorbent Bath Towels Hoodie - Microfiber Swim Surf Poncho for Beach Pool Swimming Surfing - Blue

Super Absorbent Large Bath Towel Set with Black and White Cow Pattern, Coral Fleece Thick Swimming and Home Use Towels for Boys

Black and White Cow Pattern Bath Towels Set Super Absorbent Coral Fleece Shower Towels Extra Large Thick Skin-Friendly for Boys

Customers who viewed this product also viewed

Pliers

QWORK Multi-Directional Hose Clamp Pliers for Secure Fuel Line and Radiator Hose Clamping

QWORK 24 Inch Flexible Cable and Flat Band Hose Clamp Pliers Set - High-Carbon Steel with Manganese Phosphate Coating, Ergonomic PVC Handle, for 18mm-54mm Clamps for Auto Repairs

Draper 44052 Expert CT-RJ45 Ratchet Crimping Tool, Blue, 225mm

SK 10-Inch Pliers Wrench, Premium CR-V Construction, Smooth Parallel Jaws with Comfortable Grips

48-22-6310 10", Straight Jaw Pliers with Ream & Punch Exposed Metal Handles & Precision Ground Plier Head

5 in 1 Pliers Set with 5 Interchangeable Tool Heads - Wire Stripper, Crimping Plier, Cable Cutter, Electrician Scissors, Linesman Plier - Ergonomic 8 Inch Handle with Storage Case

Correction of product information

Product Review